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DIAGNOSIS OF SHWACHMAN-DIAMOND SYNDROME 

Field of the Invention 

The invention relates to methods for diagnosing and treating individuals 
with Shwachman-Diamond Syndrome and for detecting Shwachman-Diamond 
disease carriers. More specifically, the invention relates to the identification of 
the Shwachman-Bodian-Diamond Syndrome (SBDS) gene and the 
identification of mutations of this gene which are associated with Shwachman- 
Diamond Syndrome. 

Background of the Invention 

Shwachman-Diamond Syndrome (SDS [MIM 260400]) is an autosomal 
recessive disorder with clinical features including exocrine pancreatic 
insufficiency, haematological dysfunction, and skeletal abnormalities 1,2,3 . 
Patients with SDS have a high risk of bone marrow failure and are at risk of 
developing acute myelogenous leukaemia (AML). SDS is the second most 
common cause of pancreatic insufficiency after cystic fibrosis and involves the 
failure of development of the exocrine pancreas. Other manifestations include 
skeletal abnormalities and liver function abnormalities, the latter being notable 
in young patients. 

Many SDS patients present with malabsorption and steatorrhea related 
to their pancreatic insufficiency. Many such children fail to thrive due to the 
malabsorption and also due to their disinclination to eat normally because of 
gastrointestinal upsets. The haematological dysfunction most consistently 
involves neutropenia but can also present as thrombocytopenia or 
pancytopenia. Serious consequences for SDS patients include recurring 
severe infections that can be life threatening if the diagnosis is not made with 
the provision of prompt treatments. Further, traditional methods for treatment 
of bone marrow failure are generally not successful in SDS patients at this 
time but the surveillance and monitoring of the bone marrow to determine the 
occurrence of myelodysplasia, aplastic anaemia and/or the development of 
AML do provide some options for intervention. 
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It is therefore important for the optimum development and overall long 
term prognosis of these children that they are diagnosed as having SDS as . 
early as possible s© that infections may be treated with appropriate 
interventions, so that blood and bone marrow can be monitored for cellularity 
(numbers and cell types) and so that pancreatic enzyme supplementation 
may be instituted to provide adequate or near normal food absorption. 

There are other diseases associated with exocrine pancreatic 
dysfunction, such as Cystic Fibrosis and Pearson Marrow Syndrome, and 
other diseases such as congenital neutropenia, Blackfan-Diamond Syndrome 
and Fanconi Anaemia can mimic the haematological manifestations of SDS. 
It is important, for proper treatment, that SDS is diagnosed as early as 
possible but at present SDS can only be distinguished from other diseases 
causing similar symptoms by complex, symptom-based tests which may have 
to be repeated many times before a conclusion is reached (Rothbaum et al., 
(2002), J. Pediatrics, v. 141, pp. 266-270; Ginzberg et al., (2000), Am. J. 
Hum. Genet, v. 66, pp. 1413-1416). 

There is therefore a real need for a convenient and definitive test, such 
as a genetic test or a gene product-based immunological test, to diagnose 
SDS. Further, as the bone marrow failure aspects are so serious, there is 
need to provide new options to correct the associated deficiencies. The 
identification and analysis of the gene that is affected in SDS would provide 
for such opportunities. 

Segregation analysis of an international collection of families of SDS 
patients supports an autosomal recessive mode of inheritance (Ginzberg et 
al., (2000), Am. J. Hum. Genet., v. 66, pp. 1413-1416). Previous studies of 
families with SDS showed that the putative SDS locus mapped to the 
centromeric region of chromosome 7, to a 1 .9 cM interval at 7q1 1 4,5 . The 
genetic defect associated with the disease has, however, not previously been 
identified. 

Summary of the Invention 
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The invention provides a convenient and rapid method for the 
diagnosis of SDS, based on the finding that SDS is associated with mutations 
in a previously uncharacterised gene residing within the 1 .9 centiMorgan 
disease interval at 7q11 delineated by linkage and haplotype analysis in 
family studies 4 * 5 . The gene, with a 1 .6 kb transcript, was originally designated 
by the inventors as DEPGH and its encoded protein of 250 amino acids was 
designated depechin. The gene has been renamed as Shwachman-Bodian- 
Diamond Syndrome (SBDS) gene. A second copy previously designated 
DEPCHP and now designated SBDSP, with 97% nucleotide sequence 
identity, resides within a locally duplicated genomic block of at least 305 kb, 
and appears to be a pseudogene. Recurring mutations, the apparent result of 
recombination between the duplicated gene copies, were found in 89% of 
unrelated SDS patients (n=158), with 60% carrying two converted alleles and 
29% having a different mutation in the second allele. The extent of the 
converted segments varied but consistently included at least one of two 
critical sequence changes predicted to result in truncation of the encoded 
protein. Other less common disease alleles involve missense and 
insertion/deletion changes distinct from those in the pseudogene. The gene is 
a member of a highly conserved protein family, with putative orthologues in 
diverse species ranging from archaebacteria to eukaryotes. The archaeal 
orthologues are located within highly conserved operons that include 
homologues of genes involved in RNA processing 6 , suggesting that SDS may 
be the result of a deficiency in some aspect of RNA metabolism that is 
essential for haematopoiesis, chondrogenesis and the development of the 
exocrine pancreas. 

"SBDS or SBDS gene" is the chromosome 7q1 1 .22 gene as described 
herein which when mutated is associated with SDS. This definition includes 
sequence polymorphisms wherein the nucleotide substitutions in the gene 
sequence do not affect the function of the gene product. 

W SBDS protein" is the protein encoded by the SBDS gene. 

"Mutant SBDS gene" is the SBDS gene containing one or more 
mutations which, if present on both alleles of the gene, lead to SDS. 
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In accordance with one embodiment, the invention provides a method 
for determining whether a subject is suffering from Schwachman-Diamond 
Syndrome (SDS) comprising 

obtaining a nucleic acid sample from the subject, and 

conducting an assay on the nucleic acid sample to determine the 
presence or absence of a SBDS gene mutation associated with SDS, wherein 
the presence of a SBDS gene mutation associated with SDS in both SBDS 
alleles indicates that the subject suffers from SDS. 

In accordance with a further embodiment, the invention provides a 
method for determining whether a subject is an SDS carrier comprising 

obtaining a nucleic acid sample from the subject, and 

conducting an assay on the nucleic acid sample to determine the 
presence or absence of a SBDS gene mutation associated with SDS, wherein 
the presence of a SBDS gene mutation associated with SDS in one SBDS 
allele indicates that the subject is an SDS carrier. 

In accordance with a further embodiment, the invention provides a 
method for determining whether a subject is suffering from Shwachman- 
Diamond Syndrome (SDS) comprising 

obtaining a tissue sample from the subject, and 

conducting an assay on the tissue sample to determine the level of 
SBDS protein in the sample, wherein a reduced level of SBDS protein in the 
sample relative to a control sample indicates that the subject suffers from 
SDS. 

In accordance with a further embodiment, the invention provides a 
method for determining whether a subject is at risk for developing acute 
myelogenous leukaemia (AML) comprising 

obtaining a nucleic acid sample from the subject, and 
conducting an assay on the nucleic acid sample to determine the 
presence or absence of a SBDS gene mutation associated with SDS, wherein 
the presence of a SfiDS gene mutation associated with SDS indicates that 
the subject is at risk for development of AML. 
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In accordance with a further embodiment, the invention provides a 
method for treating a subject suffering from SDS comprising administering to 
the subject a therapeutically effective amount of a substantially purified SBDS 
protein or of an isolated nucleotide sequence encoding an SBDS protein. 

In accordance with a further embodiment, the invention provides an 
isolated nucleic acid molecule encoding an SBDS protein. 

In accordance with a further embodiment, the invention provides an 
isolated nucleic acid molecule comprising at least about 10, 20, 30, 50, 75 or 
100 consecutive nucleotides of SEQ ID NO:1 or 29. 

In accordance with a further embodiment, the invention provides a 
substantially purified SBDS protein. 

In accordance with a further embodiment, the invention provides an 
antibody which binds specifically to an epitope of an SDS protein. 

In accordance with a further embodiment, the invention provides a 
nucleotide sequence selected from the group consisting of: 

(a) 5'-GCGTAAAAAGCCACAATAC-3' (SEQ ID NO:3); 

(b) 5'-CTATGACAGTATTCGTAAGACTAGG-3' (SEQ ID NO:4); 

(c) 5*-GGGGATTTGTTGTGTCTTG-3' (SEQ ID NO:5); 

(d) 5'-CTTTCCTCCAGAAAAACAGC-3 , (SEQ ID,NO:6); 

(e) 5'-AAATGGTAAGGCAAATACGG-3' (SEQ ID NO:7); 

(f) 5'-ACCAAGTTCTTTATTATTAGAAGTGAC-3' (SEQ ID NO:8); 

(g) 5*-GCTCAAACCATTACTTACATATTGA-3' (SEQ ID NO:9); 

(h) 5'-CACTTGCTTCCATGCAGA-3' (SEQ ID NO:10); 

(i) 5'-AAAGGGTCATTTTAACACTTC-3' (SEQ ID NO:1 1 ); 

(j) 5'-GAAAATATCTGACGTTTACAACA-3' (SEQ ID NO:12); 
(k) 5'-TCCACTGTAGATGTGAACTAACTC^3' (SEQ ID NO:1 3); 
(I) 5'-CACTCTGGACTTTGCATCTT-3' (SEQ ID NO: 1 4); 
(rh) 5'-GCTTCTGCTCCACCTGAC-3'(SEQIDNO:15); 
(n) 5'AGCTATGCTGCAGCTGTTAC-3' (SEQ ID NO:16); 
(o) 5'-ATGCATGTCCAAGTTTCAAG-3' (SEQ ID NO:1 7); 
(p) 5'-TCCATGGCTATATTTTGATGA-3 (SEQ ID NO:1 8); 
(q) 5'-TAAGCCTGCCAGACACAC-3' (SEQ ID NO:1 9); 



WO 2004/020658 



PCT/CA2003/001320 



6 

(r) 5'-CACTCTGGACTTTGCATCTT-3' (SEQ ID NO:20); 

(s) 5'-TGTTGGTTTTCACCGAATA-3' (SEQ ID NO:21); 

(t) 5-AGATAAAGAAAGACACACACAACT-3' (SEQ ID NO:22); 

(u) 5'-GAAATCGCCTGCTACAAA-3' (SEQ ID NO:23); 

(v) 5'-TCAGCTTCTTGCCTTCAT-3' (SEQ ID NO:24); 

(w) 5'-TAAGTAAGCCTGCCAGACA-3' (SEQ ID NO:25); 

(x) 5'-CATCAAGGTCI 1 1 1 ICCAAG-3' (SEQ ID NO:26); 

(y) 5'-CCTGTCTCTGCCCAAGTC-3' (SEQ ID NO:27); and 

(z) 5'-AGGGAACATTTTCAAAACTCA-3' (SEQ ID NO:28). 

In accordance with a further embodiment, the invention provides a 
transgenic non-human mammal having within its genome an SBDS gene with 
at least one mutation associated with SDS. 

In accordance with a further embodiment, the invention provides a kit 
comprising at least one pair of primers suitable for amplification of at least a 
portion of an SBDS gene. 

Summary of Drawings 

Fig. 1 shows an integrated map of the interval of chromosome 7 where 
the gene deficiency that leads to SDS resides, a, The refined map interval, 
flanked by microsatellite markers D7S2429 and D7S502, is shown with 
reference to the Genbridge 3 radiation hybrid panel, b, An expanded map of 
sub regions from RH bins 65 and 72 based on genomic sequences from BAC 
clones in GenBank. The regions contains at least 305 kb that has duplicated 
intrachromosomally. The positions and orientations of the paralogous 
duplicons along 7q were determined by unique STS content and radiation 
hybrid mapping, c, Identified genes in the BAC contigs are shown. Duplicon 
A contains at least 2 genes, SBDS and SDCR2A (Shwachman-Diamond 
Critical Region-2A). d, SBDS is composed of 5 exons (coding regions in grey, 
noncoding regions in black) spanning 7.9 kb of genomic sequence. The 
location of oligonucleotide primers used for mutation screening by genomic 
PCR and RT-PCR are indicated. 
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Fig. 2 shows mutations in SBDS associated with SDS. a, Map of 
SBDS (coding regions in light blue) and sequence alignment of the exon 2 
region of SBDS and SBDSP, with gene-specific sequences in green and . 
pseudogene sequences in red. In comparison to SBDS, SBDSP exon 2 
contains sequence changes (underlined in red) that are predicted to result in 
truncation of its predicted protein product. These include an in-frame stop 
codon at 184 bp and a T>C change at 250+10 bp (corresponding to the 
invariant T of the donor splice site at 258+2 bp in SBDS) which results in the 
use of an alternate donor splice site (invariant splice site positions are boxed) 
at 250+1 bp. The sequence differences in SBDSP present restriction sites for 
Bsu36\ 9 and Dcfel at 183 bp and Cac8\ at 240+7 bp. b, Electropherograms for 
cloned sequences from the exon 2 region of SBDS reveal sequence changes 
(red) derived from gene conversion events between SBDS and its 
pseudogene; three gene converted alleles are shown. These include 
[183TA>CTJ t [258+2T>C], and an extended conversion mutation [183TA>CT 
+201 A>G +258+2T>C] with the intervening adenine (position 201) to guanine 
change. In each case, flanking sequences, including those at 129-2 bp and 
258+124 bp, have not been converted (green), c, A restriction map of the 
SBDS exon 2 amplimer (primers E and F, Fig. 1d) showing the position of 
CacSI (C) and Bsu36l (B) restriction sites. Square brackets indicate the 
positions of restriction sites corresponding to converted sequences. The 
pedigree of family SW20 is shown with affected individuals in black and 
carriers in grey. Restriction fragment analysis of PCR amplified SBDS exon 2 
sequences revealed that the brothers inherited [183TA>CT] through the father 
and paternal grandfather, and [258+2T>C] through the mother and maternal 
grandmother. Patient P1 is heterozygous for [258+2T>C] and the extended 
conversion mutation (J183TA>CT +201 A>G +258+2T>CJ). Two unrelated 
control individuals are also shown (C1 and C2). cf, Restriction maps of the 
gene and pseudogene loci showing the locations of all Nde\ restriction sites 
(N). Hybridisation of a DNA probe derived from a partial SBDS cDNA (green) 
to genomic DNAs restriction digested with Mfel indicates that members of 
family SW6 (including patient P1 with two converted alleles) show a pattern of 
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hybridisation similar to two unrelated control individuals (C3 and C4) 
indicating that no rearrangements or deletions have occurred in the vicinity of 
SBDS or SBDSP. e, Sequence traces depicting other representative coding 
mutations in patient SBDS compared to controls (N), including an insertion 
([96_97insA]), a deletion {[119delG]) and two missense mutations {[240A] 
and [505OT]). 

Fig. 3 shows expression analysis of SBDS and SBDSP. FTh Fetal 
thymus, FSp Fetal spleen, FLi Fetal Liver, FK Fetal kidney, FSM Fetal skeletal 
muscle, FLu Fetal lung, FH Fetal heart, FB Fetal brain, K Kidney, SM Skeletal 
muscle, Lu Lung, H Heart, B Brain, Li Liver, PI Placenta, Pa Pancreas, Th 
Thymus, Sp Spleen, Ly Lymphocytes, To Tonsil, BM Bone Marrow, Le 
Peripheral Blood Leukocytes, LN Lymph Node, GAPDH Glyceraldehyde-3- 
Phosphate Dehydrogenase, a, RNA expression survey of SBDS and SBDSP 
in primary tissues using a cloned RT-PCR product containing the entire SBDS 
open reading frame (primers T and R). Cumulative levels of both gene and 
pseudogene transcripts appear to be lower in thymus and bone marrow. An 
alternatively spliced product was detected in several tissues and was most 
prominent in peripheral blood leukocytes (Le). As shown in the lane indicated 
with an asterisk, this large transcript was detected with a probe derived from 
intron 1. b, Analysis of patient EBV-transformed B lymphoblastoid-derived 
RNA shows that SBDS and SBDSP cumulative expression is lower in some 
patients compared to a control individual (C). The probe used to provide a 
control for RNA loading consisted of a 983bp cloned cDNA fragment from 
glyceraldehyde 3-phosphate dehydrogenase (GAPDH). c, RT-PCR 
expression analysis of SBDS and SBDSP was carried out with specific 
oligonuleotide primers arid indicated that both transcripts are widely 
expressed. Sequencing of PCR products led to the identification of an exon 
2 minus transcript. RT-PCR indicated that the alternatively spliced product 
(shown as 349bp) is present in all tissues tested, however its expression is 
significantly lower than transcripts that include exon 2 (shown as 479bp). 

Fig. 4 shows CLUSTALX alignment of SBDS-encoded protein, SBDS, 
and representative orthologues. Strong conservation is seen throughout the 



WO 2004/020658 



PCT/CA2003/001320 



9 

alignment from archaebacteria to complex eukaryotes. represents 
absolutely conserved residues in the alignment, 7 represents positions at 
which conservative amino acid substitutions are observed and 7 represents 
semi conservative substitutions. The degree of sequence similarity is less 
pronounced towards the C-terminus although subgroups retain strong 
conservation. The human amino acid sequence (Hsa) is shown in bold. The 
locations of all identified coding mutations are represented as white letters on 
a black background and corresponding amino acid sequence changes are 
shown above the alignment. A putative U1-like zinc finger domain in three 
plant orthologues is indicated with a black bar. Ath Arabidopsis thaliana, Dme 
Drosophila melanogaster, Cel Caenorhabditis etegans, Mmu Mus musculus, 
Hsa Homo sapiens, Ola Oryzias latipes, See Saccharomyces cerevisiae, Ecu 
Encephalitozoon cuniculi, Mac Methanosarcina acetivorans str. C2A, Hnr 
Halobacterium sp. NRC-1 , Mka Methanopyrus kandleri str. AV19, Mja 
Methanococcus jannaschii, Afu Archaeoglobus fulgidus, Pab Pyrococcus 
abyssi, Tac Thermoplasma acidophilum, Pae Pyrobaculum aerophilum, Sso 
Sulfolobus solfataricus, Ape Aeropyrum pemix, Pba Populus balsamifera, Gar 
Gossypium arboreum, + derived from partial GenBank EST sequence. 

Fig. 5 shows the SBDS cDNA and. its predicted encoded polypeptide. 
A: The nucleotide sequence of the cDNA corresponding to SBDS mRNA is 
shown numbered with the +1 starting at the first nucleotide, A, of the 
translation initiating codon. The 5' and 3' untranslated regions are shown in 
lower case, and the coding segment is shown in upper case text. B: amino 
acid sequence of the encoded polypeptide of 250 amino acids is shown 
numbered. 

Fig. 6 shows the aligned genomic sequence for the human SBDS gene 
(SBDS) and its pseudogene SBDSP (SBDSP) and for the mouse SBDS gene 
(MUSBDS). The sequences for the five human exons are included with 
numbering that corresponds to that indicated in Fig. 5A. SBDS specific 
oligonucleotide primers that can be used to determine the nucleotide 
sequence of expressed RNA or of each of the exons for mutation detection 
are indicated by underlining of the SBDS sequence. Dual specific 
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oligonucleotide primers are indicated by the underlining of both SBDS and 
SBDSP sequences. The sequence of oligonucleotide primers indicated in the 
forward direction (the arrows pointing to the right) correspond directly to the 
sequence shown, while those primers in the reverse direction (the arrows 
pointing to the left) are comprised of the reverse complement of the indicated 
sequence. 

Figure 7 shows the specificity and reactivity of antibodies produced to 
detect the SBDS protein, a, Polyclonal antibodies produced with recombinant 
SBDS (anti-rSBDS), left panel or a carboxyl peptide (anti-CpSBDS) of amino 
acids 224-239 (aa^lKKETKGKGSLEVLNL 239 ) of SBDS, right panel, detected 
single bands of the predicted size in whole cell extracts of induced host E. coli 
BL21 containing the pET-28a expression vector with an in-frame fusion of the 
entire SBDS open reading frame. A polyclonal antibody to an amino peptide 
(anti-NpSBDS) of amino acids 32-47 (aa 32 CYKNKWGWRSGVEKD 47 ) of 
SBDS has also been generated, data not shown, b, The anti-rSBDS antibody 
also detected SBDS expressed transiently in HEK293 cells under the control 
of a CMV promoter. The bands corresponds to those detected by anti-Myc or 
anti-HA antibodies. The subtle shifts in sizes are due to the various epitope 
tags and/or their locations that have been fused in frame to the SBDS gene, 
including amino or carboxyl positioned Myc (N-Myc or C-Myc) N-HA or amino 
or carboxyl positioned HA (N-HA or C-HA) tags, c, Anti-rSBDS also detected 
a prominent band in whole cell extracts of the predicted size for SBDS in 
BxPC3 (ATCC CRL-1687), SV40-transformed human fibroblasts (GM00639), 
Caco-2 (ATCC HTB-37), AR42J (ATCC CRL-1492), EBV transformed human 
lymphoblast (GM003798), PANC1 (ATCC CRL-1469) and J.RT3 (ATCC TIB- 
153) cell lines. The total protein loaded per extract is as indicated below each 
panel. 

Detailed Description of the Invention 

The inventors have identified the SBDS gene and described the 
association of mutations in that gene with the autosomal recessive disease, 
SDS. 
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Clinical presentation in SDS can be variable but family studies have 
supported a single gene locus near the centromere at 7q1 1 2,4,5 . Eighteen 
positional candidate genes were identified in compiled genomic sequences 
from the locus, and eight of these were analysed for mutations in members of 
linked families. Disease-associated changes were identified in a gene 
represented by the full length, 1.6 kb cDNA clone flj10917 (OVARC1 000321). 
The gene was initially designated by the inventors as DEPCH (Development 
of Exocrine Pancreas, Chondrocytes and Haematological lineages). The gene 
has been renamed (as approved by the Human Genome Organisation Gene 
Nomenclature Committee) as Shwachman-Bodian-Diamond Syndrome 
(SBDS) gene. The cDNA sequence is given in Fig. 5A (SEQ ID NO:1). 
SBDS is composed of 5 exons spanning 7.9 kb, and is contained in BAC 
clone RP11-325K1. The nucleotide sequences of the exons and surrounding 
introns are given in Fig. 6. The sequence of murine SBDS is also shown in 
Fig. 6. SBDS and part of an adjacent gene reside in a block of genomic 
sequence of at least 305 kb that is locally duplicated (Fig. 1 ). The paralogous 
duplicon was mapped distally, and contains an unprocessed pseudogene 
copy of SBDS t named SBDSP. The pseudogene transcript is 97% identical to 
the SBDS transcript with small deletions and single nucleotide changes that 
clearly disrupt coding potential. 

The protein product encoded by SBDS, termed SBDS, is a member of 
a highly conserved protein family (Pfam UPF00023) 20 . Orthologues exist in 
species ranging from archaebacteria to vertebrates and plants (Fig. 4). The 
sequence of 250 amino acids is given in Fig. 5B (SEQ ID NO:2) for a 
predicted polypeptide of 28.8kDa with a pi of 8.9. The predicted amino acid 
sequence has no homology to any known functional domain, and no signal 
peptides were detected. The S. cerevisiae orthologue, encoded by ORF 
YLR022c, has been found to bind specifically and with high affinity to the 
phospholipids PI(4,5)P2 and PI(4)P using yeast proteome chips 21 . The gene 
has also been deleted by the Yeast ORF Deletion Project and haploid spores 
lacking YRL022c were found to be inviable 22 . Indirect lines of evidence 
suggest that orthologues of SBDS may play a role in RNA metabolism. First, 
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YLR022c has been clustered with other genes encoding RNA processing 
enzymes based on microarray expression profile analysis 23 . In addition, 
SBDS archael orthologues are located in conserved operons that contain 
several RNA processing genes, including homologues of subunits of the 
eukaryotic exosome and RNaseP complexes 8 . The A. thaliana orthologue, 
along with sequences derived from partial cDNAs from P. balsamifera and G. 
arboreum, have extended carboxyl termini corresponding to putative RNA- 
binding domains, suggesting a functionally relevant fusion in flowering plants 
(Fig. 4). These observations suggest that SDS may be the result of a defect 
in an RNA processing pathway. Manifestation of disease must reflect the loss 
or perturbation of a cellular function that is particularly critical for the 
development of pancreatic acini, myeloid lineages, and chrondrocytes at 
growth plates of bones. The associated symptoms and the complications due 
to bone marrow failure may reflect not only the loss of one gene but also 
pleiotropic consequences of an aberrant pathway. 

Sequence changes that do not alter protein-associated activities and 
that occur in normal individuals are likely to correspond to gene 
polymorphisms. A current accepted standard to discriminate polymorphisms 
from mutations is to screen 100 individuals of comparable ethnic background 
that are not affected with SDS. Examples of polymorphisms detected in 
SBDS are given in Table 2. SDS-associated mutations are shown in Table 1 . 

Diagnostic Methods 

The invention provides a diagnostic method for determining whether a 
subject, such as a human subject, suffers from, or is at risk of developing, 
symptoms of SDS. In one embodiment, the method involves examining a 
nucleic acid sample from the subject for the presence or absence of a 
mutation of the SBDS gene associated with SDS. Such mutations include 
183J84TA->CT; 183J84TA->CT+258+2T-»C; 258+2T->C; 24C-+A; 96- 
97insA; 119 deIG; 131A-*G; 199A->G; 258+1G^C; 260T-+G; 291- 
293delTAAinsAGTTCAAGTATC; 377G-+C; 505C-+T; 56G->A; 93C->G; 
97A-*G; 101A-*T; 123delC; 279_284delTCAACT; 296_299delAAGA; 
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354A-*C; 428C-»T+443A-*G; 458A->G; 460-1G-»A; 506G->C; and 
624+1 G->C. These mutations are identified in relation to the numbering of 
the nucleotide sequence of SEQ ID NO:1. 

Many methods known to those of skill in the art can be used to detect 
the presence or absence of a SBDS gene mutation in the subject's nucleic 
acid. 

The cDNA sequence of the wild type SBDS gene is shown in Figure 5 
and is available at GenBank Accession Number AY1 69963 (NM_016038). 
The exon structure and flanking intron sequences are shown in Figure 6. 

"Mutations" of the wild type SBDS gene associated with SDS include 
conversions, deletions, insertions, inversions or point mutations, either in the 
coding regions of the gene or gene regulatory regions. 

A number of types of assay may be used to determine whether a 
subject has an SBDS gene mutation associated with SDS, including; for 
example, sequencing exons or other portions of the gene, including regulatory 
or intronic segments, PCR-RFLP analysis, allele specific PCR, allele specific 
oligonucleotide hybridisation restriction fragment length polymorphism (RFLP) 
analysis. 

Where a direct sequencing assay is used, the sample may be DNA or 
RNA, for example genomic DNA or mRNA. Gene-controlling DNA segments 
and exons of an individual can be amplified and then examined for direct 
sequence changes, or scanned with methods that detect a heterozygous state 
followed by sequencing. These latter scanning methods can include single 
stranded conformational analysis (Orita M, Iwahana H, Kanazawa H, Hayashi 
K and Seklya T (1989), "Detections of polymorphisms of human DNA by gel 
electrophoresis as single-stranded conformation polymorphisms", Proc. Natl. 
Acad. Sci, USA 86: 2776-2770), denaturing gradient gel electrophoresis 
(Wartell RM, Hosseini SH and Moran CP Jr (1990), "Detecting base pair 
substitutions in DNA fragments by temperature-gradient gel electrophoresis", 
(Nucleic Acids Res. 18: 2699-2705; Sheffield VC, Cox DR, Lerman LS and 
Myers RM (1989) or "Attachment of a 40-base-pair G + C rich sequence (GC 
clamp) to genomic DNA fragments by the polymerase chain reaction results in 
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improved detection of single-base changes" (Proc. Natl. Acad. Sci, USA 86: 
232-236); and denaturing high pressure liquid chromatography Cotton RGH, 
Edkins E, Forrest S (eds) 1998 "Mutation detection: a Practical Approach" IRL 
Press, Oxford, and heteroduplex analysis Keen J, Lester D, Ingleheam C, 
Curtis A, Bhattacharya S (1991) Rapid detection of single base mismatches 
as heteroduplexes on Hydrolink gels. Trends Genet, 7:5, amongst other 
methods. Larger deletions or insertions can be detected by traditional 
Southern blot analysis of DNA digest with restriction enzymes (Southern EM. 
(1975) 'Detection of specific sequences among DNA fragments separated by 
gel electrophoresis', J Mol Biol 98:503-1 7). Mutant alleles can be 
distinguished by observing their inheritance from each parent and although 
each patient will have two affected alleles, they will typically appear in 
heterozygous state (all of the references of this paragraph are incorporated 
herein by reference). 

The diagnostic methods of the invention are used to screen subjects 
showing symptoms of possible SDS, such as pancreatic insufficiency to 
identify SDS, or to screen relatives of known SDS cases to determine whether 
they may be at risk of developing SDS symptoms. 

The diagnostic method of the invention should preferably be carried out 
on samples from children at a young age in order to establish the diagnosis 
and allow appropriate treatment. The diagnostic method may also be used as 
a prenatal test, using amniotic fluid or CVS samples. 

With respect to determining carrier status, as discussed below, the test 
may be carried out at any age, preferably at an age greater than 1 6 years in 
relatives of SDS patients. 

Signs of SDS generally are evident in children at an early age and the 
diagnostic methods of the invention will usually be employed to determine if a 
child presenting with SDS symptoms is indeed suffering from SDS. On 
occasion, a sibling or close relative may be screened to determine if he or she 
suffers from SDS. 

Suitable samples for testing of nucleic acid include buccal swabs, 
blood samples and bone marrow aspirates. 
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In one embodiment, genomic DNA is extracted from the sample and a 
target portion of the genomic DNA comprising the SBDS gene or a selected 
portion thereof is amplified by a polymerase chain reaction using suitable 
oligonucleotide primers, such as those described herein. The amplified 
nucleic acid is then sequenced using conventional techniques. The sequence 
is compared with the wild type sequence to determine the presence or 
absence of SDS-associated mutations. Primers must be selected which will 
amplify only the SBDS gene and not the pseudogene, as shown in Figure 6. 
Since a larger number of SDS-associated mutations have been observed in 
exon 2 of SBDS gene, it is preferable to look first for mutations in that exon. If 
no mutations are found in exon 2, exons 1 and 3 to 5 are similarly examined 
in turn. 

One of skill in the art can select suitable primers by reference to the 
SBDS sequence of Figure 6, suitable primers are also identified in Example 1 . 
Preferred primer pairs for amplification of SBDS exons are as follows: 

Exonl: A&BorQ&B; 

Exon 2: E & F; 

Exon 3: G & H; 

Exon 4: SDCR9x4seqB; 

(5' - GCCTTCACTTTCTTCATAGT - 3') & J; and 

Exon 5: SDCR9x5Fseq 

(5' - GCTTGCCTCAAAGGAAGTT - 3') & L. 

Regulatory regions of SBDS, such as the promoter region, may also be 
examined using suitable primers. 

Promoter primers include SDCR9prom1RA (5' - 
CAGCCGACGACCTTGTTTT - 3') and SDCR9prom6FA (5' - 
GTGCCAACGCTGTGTTTT - 3"). 

These primers amplify a 501 bp segment partially overlapping exon 1 , 
which likely contains the major controlling elements for the transcription of 
SBDS mRNA. 

For conversion mutations found in exon 2, examination of the test 
subject's parents can be used to distinguish whether the subject has two 
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conversion recombinations rather than one extended conversion 
recombination. 

In a further embodiment of the invention, an RNA sample is obtained 
from the test subject and is reverse transcribed by conventional methods to 
give a corresponding cDNA which is amplified by PGR and sequenced. 

In a further embodiment, RFLP analysis may be used to detect SBDS 
gene mutations. Such methods of analysis are well known to those of skill in 
the art and an example is described in the Examples herein and in reference 
30. Test samples are compared with normal controls and samples from 
patients with known mutations. 

In a further embodiment, analysis of SBDS expression or of the level of 
SBDS protein may used to determine whether a subject suffers from or is at 
risk of SDS. As described herein, SBDS is expressed in a wide variety of 
tissues, including the most disease-relevant tissues, pancreas, bone marrow 
and myeloid cell lineages. A blood or tissue sample may therefore be used to 
evaluate SBDS expression or SBDS protein level. As seen in Figure 3b, 
mRNA level is notably reduced in SDS patients. SBDS expression can be 
evaluated by many routine methods, for example by mRNA analysis as 
described in the Examples herein and in reference 30. 

In a further embodiment, an antibody specific for SBDS protein and 
carrying a detectable label can be used to assess the level of SBDS protein in 
a tissue sample of a subject by an immunological technique. Many suitable 
techniques, such as immunoprecipitation or ELISA assays, are known to 
those of skill in the art and are described, for example, in "Using antibodies - a 
laboratory manual", (1999), Harlow et al., Cold Spring Harbor Lab. Press. 
The level of protein in a test subject is compared with that in similar tissue 
samples from unaffected individuals, a reduction in level of SBDS protein 
being indicative of SDS. The identification of the SBDS gene and the 
absence of any known closely related homologues enables the preparation of 
antibodies highly specific for SBDS protein. 

Detection of SDS Carriers 
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The invention further provides a method for determining whether a 
subject is an SDS carrier by determining whether the subject has an SDS- 
associated mutation in one allele of the SBDS gene. 

The methods described above for detecting an SDS-associafed 
mutation in a sample from a subject suspected of suffering from SDS may 
also be applied to detect carriers of the disease. The described methods for 
detecting such mutations in a nucleic acid sample from a subject are 
preferred. 

Screening for SDS carriers is earned out especially on members of 
families with known SDS cases and may be important for genetic counselling 
of such family members regarding their likelihood of passing the disease on to 
their children. Generally, a method would be used to look for a specific 
mutation already found in an affected family member. 

Identification of Further Mutations 

The present invention also enables the identification of additional SDS- 
associated mutations of the SBDS gene, for example by examining SDS 
patients using the methods and primers described herein. 

.. Amplification of target portions of the gene, followed by direct nucleic 
acid sequencing, as described herein for diagnostic purposes, and 
comparison with the wild type sequence, may be used to identify additional 
SDS-associated mutations. 

Alternatively, assessment of the expression level of the SBDS gene, as 
described herein, may indicate reduced expression levels and point to further 
mutations which can be characterised by nucleic acid analysis as described 
above. 

Nucleic Acids 

The invention provides SBDS nucleic acids and homologues and 
portions thereof. Preferred nucleic acids have a nucleotide sequence which is 
at least 80%, preferably at least 90% and more preferably more than 97% 
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homologous to the nucleotide sequence of SEQ ID NO:1 or SEQ ID NO:29 or 
to a complement thereof. 

Preferred nucleic acids are mammalian and especially preferred are 
human nucleic acids. Nucleic acids of the invention include nucleic acids 
encoding an amino acid sequence with at least 75%, preferably at least 90% 
and more preferably at least 99% amino acid identity to the amino acid 
sequence of SEQ ID NO:2, and nucleic acids encoding a portion of such 
amino acid sequences. 

Also within the scope of the invention are nucleic acid molecules useful 
as probes or primers and comprising at least about 10, 20, 30, 50, 75, 90 or 
100 consecutive nucleotides of SEQ ID NO:1. 

Also within the scope of the invention are nucleic acids which hybridise 
under stringent conditions to a nucleic acid of the nucleotide sequence SEQ 
ID NO:1 or to a complement or a portion thereof. Stringent conditions for 
nucleic acid hybridisation are known to those skilled in the art and are 
described, for example, in "Protocols in Molecular Biology", (1989), John 
Wiley & Sons, N.Y., at 6.3.1 to 6.3.6. 

Also within the scope of the invention are nucleic acids which differ 
from the sequence of SEQ ID NO:1 due to the degeneracy of the genetic 
code. 

Proteins 

The invention provides substantially purified SBDS proteins and 
portions thereof. These proteins and portions thereof are useful for the 
preparation of antibodies specific for SBDS proteins. 

"Substantially purified" as used herein with respect to proteins means a 
protein preparation which is at least 75%, more preferably at least 90% and 
most preferably at least 99% by weight of SBDS protein. 

Preferred SBDS proteins have an amino acid sequence which is at 
least about 75%, preferably at least about 90% and more preferably at least 
about 99% identical to the amino acid sequence of SEQ ID NO:2. 
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In a preferred embodiment, the SBDS protein has the amino acid 
sequence of SEQ ID NO:2. Full length proteins and portions thereof 
corresponding to one or more domains thereof or comprising at least 5, 10, 
25, 50, 75 or 100 consecutive amino acids of SEQ ID NO:2 are within the 
scope of the invention. 

The proteins and peptides of the invention may be isolated and purified 
by conventional protein purification methods such as gel-filtration 
chromatography, ion exchange chromatography, high performance liquid 
chromatography, immunoprecipitation or immunoaffinity purification. 

SBDS proteins may be prepared by conventional recombinant 
methods, for example using the cDNAs described herein (for example human 
sequence has Genbank Accession Number AY1 69963) or a selected portion 
thereof. Since the SBDS gene is small, native gene expression may be 
achieved with the incorporation of natural promoter and enhancer gene 
elements. Suitable vectors and host cells for such expression are well known 
to those of skill in the art. 

The expressed protein can be purified by standard procedures, as 
described above. 

Antibodies 

The present invention also enables the preparation of antibodies or 
antibody fragments which bind specifically to SBDS protein or to a portion 
thereof. 

The term "antibody" means a monoclonal antibody or a polyclonal 
antibody, which binds specifically to a particular peptide, polypeptide or 
epitope, i.e. with greater affinity than to other peptides, polypeptides or 
eptiopes, and includes chimeric antibodies, humanised antibodies and single 
chain antibodies. 

Chimeric antibodies are antibodies which contain portions of antibodies 
from different species. For example, a chimeric antibody may have a human 
constant region and a variable region from another species. Chimeric 
antibodies may be produced by well known recombinant methods, as 
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described in U.S. Patents Nos. 5,354,847 and 5,500,362, and in the scientific 
literature (Couto et al. f (1993), Hybridoma, 12:485-489). 

Humanised antibodies are antibodies in which only the 
complementarity determining regions, which are responsible for antigen 
binding and specificity, are from a non-human source, while substantially all of 
the remainder of the antibody molecule is human. Humanised antibodies and 
their preparation are also well known in the art - see, for example, U.S. 
Patents Nos. 5,225,539; 5,585,089; 5,693,761 and 5,693,762. 

Single chain antibodies are polypeptide sequences that are capable of 
specifically binding a peptide or epitope, where the single chain antibody is 
derived from either the light or heavy chain of a monoclonal or polyclonal 
antibody. Single chain antibodies include polypeptides derived from 
humanised, chimeric or fully-human antibodies where the single chain 
antibody is derived from either the light or heavy chain thereof. 

The term "antibody fragmenf means a portion of an antibody that 
displays the specific binding of the parent antibody and includes Fab, F (ab'fe 
and F v fragments. 

Polyclonal Antibodies 

In order to prepare polyclonal antibodies, purified SBDS protein may be 
obtained, for example, as described herein. The purified protein or a portion 
thereof, coupled, if desired, to a carrier protein such as bovine serum albumin 
or keyhole limpet hemocyanin, as in Cruikshank WW, Center DM, Nisar N, 
Wu M, Natke B, Theodore AC, and Komfeld H., (1994), Proc. Natl. Acad. Sci. 
USA 24: 5109-5113, is mixed with Fruend's adjuvant and injected into rabbits 
or other suitable laboratory animals. 

Following booster injections at weekly intervals, the rabbits or other 
laboratory animals are then bled and the sera isolated. The sera can be used 
directly or purified prior to use by various methods including affinity 
chromatography employing Protein A-Sepharose, antigen Sepharose or Anti- 
mouse-lg-Sepharose. Further purification methods well known in the art may 
be utilised to remove viral and/or endotoxin contaminants. 
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Monoclonal Antibodies 

As will be understood by those skilled in the art, monoclonal antibodies 
may also be produced using an SBDS protein or a portion thereof. The . 
protein or portion thereof, coupled to a carrier protein if desired, is injected in 
Freund's adjuvant into mice. After being injected three times over a three- 
week period, the mice spleens are removed and resuspended in phosphate 
buffered saline (PBS). The spleen cells serve as a source of lymphocytes, 
some of which are producing antibody of the appropriate specificity. These 
are then fused with a permanently growing myeloma partner cell, and the 
products of the fusion are plated into a number of tissue culture wells in the 
presence of a selective agent such as HAT. The wells are then screened by 
ELISA to identify those containing cells making binding antibody. These are 
then plated and after a period of growth, these wells are again screened to 
identify antibody-producing cells. Several cloning procedures are carried out 
until over 90% of the wells contain single clones which are positive for 
antibody production. From this procedure a stable line of clones which 
produce the antibody is established. The monoclonal antibody can then be 
purified by affinity chromatography using Protein A Sepharose, ion-exchange 
chromatography, as well as variations and combinations of these techniques. 
Truncated versions of monoclonal antibodies may also be produced by 
recombinant techniques in which plasmids are generated which express the 
desired monoclonal antibody fragment in a suitable host. 

In a further embodiment, a cell line is provided which secretes an 
antibody specific for an SBDS protein or a portion thereof; a cell line secreting 
an antibody specific for a human SBDS protein is preferred. 

Diagnosis of Predisposition to A ML 

A number of SDS patients have been found to develop AML It is of 
some concern that individuals who have survived into adulthood without being 
diagnosed as SDS sufferers, because of minimal or unrecognised symptoms, 
may nevertheless also be at risk for the development of AML. The present 
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invention permits the identification of these individuals as SDS sufferers, so 
that they may be monitored for early signs of AML and appropriately treated. 
Although widespread screening of the population may not be practical, 
screening of relatives of diagnosed SDS patients for SDS-associated 
mutations is completely feasible, as also would be screening individuals 
exhibiting early or more overt signs of bone marrow transformation. 

In addition, SDS carriers, who have an SDS-associated mutation in 
only one allele of the SBDS gene and are therefore asymptomatic, may be at 
risk for AML if they should experience loss or mutation of the wild-type allele, 
particularly in haemotological tissues. Again, screening of family members in 
SDS-affected families will indicate such genetic changes. 

Kits 

The invention further provides kits for use in the diagnostic methods 
described above for determining whether a subject is suffering from or is at 
risk for SDS, for determining whether a subject is a carrier of SDS or for 
determining whether a subject is at risk for AML. Such kits can comprise, for 
example, one or more pairs of oligonucleotide primers suitable for 
amplification of the SBDS gene or portions thereof, such as primers suitable 
for amplification of particular exons of SBDS, particularly human SBDS, as 
described for example in Figure 6. such kits can also contain instructions for 
use of the primers, and optionally, additional reagents required for the 
diagnostic methods described herein. 

Therapeutic Methods 

The invention further provides methods and compositions for treating 
subjects, including humans, suffering from SDS. 

Methods of treatment are directed to restoring normal SBDS function in 
the subject. 
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Such methods include gene therapy to restore normal function at the 
gene level and administration of normal SBDS protein or portions thereof to 
make up for lack of normal gene expression. 

Gene therapy may, for example, involve administration to the subject of 
a construct comprising an expression vector containing a nucleotide 
sequence encoding a wild type SBDS protein. Suitable expression vectors 
include retroviral, adenoviral and vaccinia virus vectors. Administration may 
be intravenous, oral, subcutaneous, intramuscular or intraperitoneal. 

A large number of gene delivery methods are well known to those of 
skill in the art and may include, for example liposome-based gene delivery 
(Debs and Zhu (1993) WO 93/24640; Mannino and Gould-Fogerite (1988) 
BioTechniques 6(7): 682-691; Rose U.S. Pat No. 5,279,833; Brigham (1991) 
WO 91/06309; and Feigner et al. (1987) Proc. Natl. Acad Sci. USA 84: 7413- 
7414), and replication-defective retroviral vectors harboring a therapeutic 
polynucleotide sequence as part of the retroviral genome (see, e.g., Miller et 
al. (1990) Mol. Cell. Biol. 10:4239 (1990); Kolberg (1992) J. NIH Res. 4:43, 
and Cometta et al. Hum. Gene Ther. 2:215 (1991)). Widely used retroviral 
vectors include those based upon murine leukemia virus (MuLV), gibbon ape 
leukemia virus (GaLV), Simian Immuno deficiency virus (SIV), human immuno 
deficiency virus (HIV), and combinations thereof. See, e.g., Buchscher et al. 
(1992) J. Virol. 66(5) 2731-2739; Johann et al. (1992) J. Virol. 66 (5):1635- 
1640 (1992); Sommerfelt et al., (1990) Virol. 176:58-59; Wilson et al. (1989) J. 
Virol. 63:2374-2378; Miller et al.; J. Virol. 65:2220-2224 (1991); Wong-Staal et 
al., PCT/US94/05700, and Rosenburg and Fauci (1993) in Fundamental 
Immunology, Third Edition Paul (ed) Raven Press, Ltd., New York and the 
references therein, and Yu et al., Gene Therapy (1994) supra). 

AAV-based vectors are also used to transduce cells with target nucleic 
acids, e.g., in the in vitro production of nucleic acids and peptides, and in in 
vivo and ex vivo gene therapy procedures. See, West et al. (1987) Virology 
160:38-47; Carter et al. (1989) U.S. Pat. No. 4,797,368; Carter et al. WO 
93/24641 (1993); Kotin (1994) Human Gene Therapy 5:793-801; Muzyczka 
(1994) J. Clin. Invest. 94:1351 and Samulski (supra) for an overview of AAV 
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vectors. Construction of recombinant AAV vectors are described in a number 
of publications, including Lebkowski, U.S. Pat No. 5,173,414; Tratschin et al. 
(1985) Mol. Cell. Biol. 5(1 1):3251 -3260; Tratschin, et al. (1984) Mol. Cell. Biol. 
4:2072-2081; Hermonat and Muzyczka (1984) Proc. Natl. Acad. Sci. USA 
81:6466-6470; McLaughlin et al. (1988) and Samulski et al. (1989) J. Virol. 
63:03822-3828. Cell lines that can be transformed by rAAV include those 
described in Lebkowski et al. (1988) Mol. Cell. Biol. 8: 3988-3996. 

The organ with the most serious life threatening consequences, the 
bone marrow, may be treated by ex vivo gene therapy. This would involve the 
1 ) extraction of bone marrow cells, 2) introduction of cDNA without mutations 
in conjunction with expression guiding elements followed by 3) re-introduction 
of these modified cells back to the bone marrow. Similar strategies have 
been used successfully in other diseases including severe combined 
immunodeficiency -X1 (M Cavazzana-Calvo, S Halcein-Bey, G de Saint 
Basile, F Gross, E Yvon, P Nusbaum, F Selz, C Hue, S Certain, J-L 
Casanova, P Bousso, F Le Deist and A Fischer. (2000) Gene therapy of 
human severe combined immunodefiency (SCID)-X1 disease. Science 288: 
669-672; all of which are incorporated herein by reference). The SBDS gene 
is notably small such that native gene expression may be achieved with the 
incorporation of natural promoter and enhancer gene elements. 

The SBDS nucleotide sequences described herein may be used in 
conventional expression systems, as described herein, to permit production of 
depechin protein in amounts sufficient for antibody production or for therapy. 

Therapeutic compositions in accordance with the invention comprise 
an isolated nucleotide sequence encoding an SBDS protein or effective 
fragment thereof or a substantially purified SBDS protein or effective fragment 
thereof. 

Transgenic animal models of SDS 

The invention further enables the creation of an animal model of SDS 
which is important for further study of how SBDS mutations lead to the various 
SDS-associated disease manifestations and for testing of potential 



WO 2004/020658 



PCT/CA2003/001320 



25 

therapeutics. A number of non-human mammals may be used to create such 
a model, including without limitation mice, rats, rabbits, sheep, goats and non- 
human primates. An animal model of SDS may have within its genome one 
or both SBDS genes with at least one mutation which when expressed results 
in symptoms of SDS. Identification and sequencing of the mouse SBDS gene 
homologue, as described herein, facilitates the creation of such animal 
models, for example a mouse model. 

Methods for the creation of transgenic animals are well known to those 
of skill in the art. A transgenic animal according to the invention is an animal 
having cells that contain a transgene which was introduced into the animal or 
an ancestor of the animal at a prenatal (embryonic) stage. A transgenic 
animal can be created, for example, by introducing the gene of interest into 
the male pronucleus of a fertilised oocyte by, e.g., microinjection, and allowing 
the oocyte to develop in a pseudopregnant female foster animal. The gene of 
interest may include appropriate promoter sequences, as well as intronic 
sequences and polyadenylation signal sequences. Methods for producing 
transgenic animals are disclosed in, e.g., U.S. Pat. Nos. 4,736,866 and 
4,870,009 and Hogan et al., A Laboratory Manual, Cold Spring Harbor 
Laboratory, 1986. A transgenic founder animal can be used to breed 
additional animals carrying the transgene. A transgenic animal carrying one 
transgene can also be bred to another transgenic animal carrying a second 
transgene to create a "double transgenic" animal carrying two transgenes. 
Alternatively, two transgenes can be co-microinjected to produce a double 
transgenic animal. Animals carrying more than two transgenes are also 
possible. Furthermore, heterozygous transgenic animals, i.e., animals 
carrying one copy of a transgene, can be bred to a second animal 
heterozygous for the same transgene to produce homozygous animals 
carrying two copies of the transgene. For a review of techniques that can be 
used to generate and assess transgenic animals, skilled artisans can consult 
Gordon (Intl. Rev. Cytol., 115:171-229 (1989)), and may obtain additional 
guidance from, for example: Hogan et al, Manipulating the Mouse Embryo 
(Cold Spring Harbor Press, Cold Spring Harbor, N.Y. 1986); Krimpenfort et 
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al., Biotechnology, 9:844-847 (1991); Palmiter et al., Cell, 41:343-345 
(1985); Kraemer et al., Genetic Manipulation of the Early Mammalian Embryo 
(Cold Spring Harbor Press, Cold Spring Harbor, N.Y. 1985); Hammer et al., 
Nature, 315:680-683 (1985); Purscel et al., Science, 244:1281-1288 (1986); 
Wagner et al., U.S. Pat. No. 5,175,385; and Krimpenfort et al., U.S. Pat. No. 
5,175,384. 

EXAMPLES 

The examples are described for the purposes of illustration and are not 
intended to limit the scope of the invention. 

Methods of molecular biology, genetics, protein and peptide 
biochemistry and immunology referred to but not explicitly described in this 
disclosure and examples are reported in the scientific literature and are well 
known to those skilled in the art. 

Methods 

Human Subjects. Families with SDS included in this study have been 
described, and additional families have been obtained through ongoing . 
recruitment 2 . The criterion for inclusion in the study was the presence of both 
exocrine pancreatic dysfunction and haematologic abnormalities, including 
neutropenia and other problems associated with bone marrow failure. 
Consent was obtained from all participating families, and procedural approval 
was obtained from the human subjects review board of The Hospital for Sick 
Children, Toronto (HSC). Genomic DNA was extracted either from Epstein- 
Barr virus (EBV) transformed B-lymphoblastoid cell lines or directly from 
peripheral white blood cell pellets, as described by Miller et a/. 24 . Patient and 
control RNA was extracted from EBV-transformed B-lymphoblastoid cell lines 
as previously described 25 . DNA from 100 control Caucasian individuals 
(Human variation panel HD100CAU) was purchased from Coriell Cell 
Repositories (Camden, NJ). 
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Physical Mapping. Genomic sequences were identified through BLAST 
analysis of STSs and genetic markers in the SDS critical interval against the 
GenBank non-redundant (nr) and high throughput genome sequence (htgs) 
databases 26 . Where the density of pre-existing markers was low, BAC and 
YAC clones assigned to the region were subcloned and sequenced to provide 
new STSs as described 5 . Genomic sequences were compiled manually and 
the framework was supported by radiation hybrid mapping of select STSs. 

Candidate Gene Identification. Candidate genes were identified in genomic 
sequences through the use of annotation data provided by GenBank 
(http://www.ncbi.nlm.nih.gov) and Project Ensembl 
(http://www.ensembl.org) 26,27 . Ab initio gene predictions were obtained 
through the use of GeneScript . Human genomic sequences were also 
compared to mouse genomic sequences (available through Cetera Discovery 
System and Celera Genomics' associated databases) from the syntenic 
interval on mouse chromosome 5 using PipMaker2 to identify regions of 
cross-species conservation 28 . All in silico gene predictions were confirmed by 
RT-PCR analysis using random-primed cDNA derived from fetal brain, and/or 
testes poly(A)+ mRNA (Clontech, Palo Alto, CA). 

Mutation Detection. The genomic structure of the SBDS gene and its 
pseudogene copy were used to design primer pairs using Primer3 to screen 
coding regions 29 . .The position of primer pairs is shown (Figs. 1 and 6). PCR 
products were directly sequenced or cloned using a Topo TA-cloning kit 
(Clontech) prior to sequencing. Primer pairs (specific for SBDS unless 
otherwise stated) used were: A (5'-G CGTAAAAAG CCACAATAC-3' ) and B 
(5'-CTATGACAGTATTCGTAAGACTAGG-3') (exon 1), C (5- 
GGGGATTTGTTGTGTCTTG-3') and D (5'-CTTTCCTCCAGAAAAACAGC-3') 
(exon 2, SBDS/SBDSP dual-specific), E (5-AAATGGTAAGGCAAATACGG- 
3') and F (5'-ACCMGTTCmA™TTAGAAGTGAC-3') (exon 2), G (5*- 
GCTCAMCCATTACTTACATATTGA-3') and H (5- 
CACTTGCTTCCATGCAGA-3') (exon 3), I (5'- 
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AAAGGGTCATTTTAACACTTC-3') and J (5- 
GAAAATATCTGACGTTTACAACA-3') (exon 4), K (5'- 
TCCACTGTAGATGTGAACTAACTC-3') and L (5- 
CACTCTGGACTTTGCATCTT-3') (exon 5), M (5- 

GCTTCTGCTCCACCTGAC-3') and N (5'AGCTATGCTGCAGCTGTTAC-3') 
(exons 1 & 2, SBDS/SBDSP dual-specific), O (5- 

ATGCATGTCCAAGTTTCAAG-3') and P (5*-TCCATGGCTATATTTTGATGA- 
3') (exons 2 & 3, SBDS/SBDSP dual-specific). Patients were also screened 
for mutations through sequencing of RT-PCR products from random-primed 
cDNA derived from patient EBV-transformed B-lymphoblastoid cell lines. 
Primers used were: Q (5'-TAAGCCTGCCAGACACAC-3') and R (5'- 
CACTCTGGACTTTGCATCTT-3') (yields full length SBDS open reading 
frame), Q and S (5'-TGTTGGTTTTCACCGAATA-3'), and T (5- 
AGATAAAGAAAGACACACACAACT-3') and R. Gene conversion mutations 
were detected through restriction analysis of exon 2 PCR fragments. Exon 2 
was amplified from patient DNA using PCR primers C & D or E & F, and 
purified using a MinElute PCR Cleanup Kit (Qiagen). Restriction digestion 
using Ddel (not shown) or Bsu036\ {[183TA>CTI) and CacSI {[258+2T>C]) 
(New England Biolabs, Beverly, MA) was carried out as recommended by the 
manufacturer and analyzed by agarose gel electrophoresis. For all mutations, 
allele-specific oligonucleotide hybridisation to amplified SBDS exons from 
control individuals was carried out as described 30 . 

Southern Hybridisation. Genomic DNA from patients and control individuals 
was subjected to restriction digestion with A/del (New England Biolabs) as 
recommended by the manufacturer and products were separated by agarose 
gel electrophoresis. The DNA was blotted and hybridised with a radiolabeled 
SBDS partial cDNA probe (exons 1-3) as described 30 . 

RT-PCR and RNA Blot Analysis. A panel of cDNAs derived from 22 adult 
and fetal tissues (Clontech) were analyzed by RT-PCR according the 
supplier's recommendations. Primers used were T and R (SBDS), and (5- 
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TAAGTAAGCCTGCCAGACA-3') and (5'-CATCAAGGTC I 1 1 1 ICCAAG-3') 
(SBDSP). Primers used to assess the relative amount of SBDS exon 2 
alternative splicing were U (5-GAAATCGCCTGCTACAAA-3') and V (5- 
TCAGCTTCTTGCCTTCAT-3'). RNA blots of poly(A)+ mRNA (Clontech) were 
hybridized to DNA probes labeled with [a^PJ-dCTP 30 . The SBDS probe was 
a cloned RT-PCR fragment (primers Q and R). The intron 1 probe was PCR 
amplified from genomic DNA using primers (5'-CCTGTCTCTGCCCAAGTC- 
3') and (5'-AGGGAACATTTTCAAAACTCA-3'). 

Sequence Alignment and Analysis. SBDS orthologues were identified 
through BLASTP analysis of amino acid sequences in the GenBank nr 
database, and through TBLASTN analysis of the GenBank EST database 
(dbEST). Sequences were aligned with CLUSTALX using default parameters 
followed by manual adjustment 31 . Amino acids were analysed for the 
presence of functional motifs using Pfam and associated databases 
(http://www.sanger.ac.uk/Software/Pfam/) 21 . 

Genbank Accession Numbers. SBDS consensus cDNA, AY1 69963 cDNA 
flj10917, AK001779; SDCR2A (cDNA flj10900), AK001762; SDCR3 (cDNA 
flj10099), AK000961; BAC RP11-458F8, AC073335; BAC RP11-325K1, 
AC079920; BAC RP11-584N20, AC069291; BAC RP11-324F21, AC073089; 
BAC RP1 1-1 6604, AC006480; BAC RP11-479C13, AC005236. Depechin 
orthologues: Arabidopsis thaliana At1g43860 gene product, NP_564488; 
Drosophila melanogaster CG8549 gene product, NP_648057; Caenorhabditis 
elegans protein W06E11 .4.p, NP_497226; Mus musculus protein 22A3, 
P70122; Oryzias latipes amino acid sequence derived from cDNA clone 
MF01SSA157A09 5' and 3' overlapping sequence reads, BJ013200 and 
BJ025159; Saccharomyces cerevisiae Ylr022cp, NP_013122; 
Encephalitozoon cuniculi ECU08J610 gene product, NP_597289; 
Methanosarcina acetivorans sfr. C2A MA1778 gene product, NP_616704; 
Halobacterium sp. NRC-1 Vng1276c, NP_280149; Methanopyrus kandleri sfr. 
AV19 MK0384 gene product, NP_613669; Methanococcus jannaschii MJ0592 
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gene product, NP_247572; Archaeoglobus fulgidus AF0491 gene product, 
NPJJ69327; Pyrococcus abyssi PAB0418 gene product NPJ26299; 
Thermoplasma acidophilum Ta1291m gene product, NP_394745; 
Pyrobaculum aerophilum PAE2209 gene product, NP_559847; Sulfolobus 
solfataricus SSO0737 gene product, NP_342243; Aeropyrum pernixAPE1167 
gene product, NP_1 47753; Populus balsamifera subsp. Trichocarpa amino 
acid sequence derived from cDNA clone F038P45Y, BI121507; Gossypium 

arboreum amino acid sequence derived from cDNA clone GA Ed0050B07f, 

BQ402534. 

Example 1 

RT-PCR analysis of several SDS patients with SeDS-specific 
oligonucleotide primers (indicated as RT-PCR primers Q and R in Fig. 1a and 
described in Fig. 6) revealed recurring sequence changes in exon 2, including 
a TA>CT dinucleotide change at position 183 or an 8 bp deletion at the end of 
the exon (the nucleotide numbering is described in Figs. 5 and 6). Analysis of 
SBDS genomic sequences confirmed the presence of the [183TA>CT] 
sequence change and revealed a [258+2T>C] nucleotide change in patients 
expressing the deleted SBDS transcript. [258+2T>C] is predicted to disrupt 
the donor splice site of intron 2, and the 8 bp deletion observed in the 
transcript is consistent with use of an upstream cryptic splice donor site at 
position 251 . Alignment of patient SBDS sequences to genomic sequences 
from GenBank and control individuals indicated that both changes 
corresponded to sequences normally present in SBDSP (Fig. 2a, b). The 
dinucleotide alteration [183TA>CT] introduces an in-frame stop codon (K62X) 
while [258+2T>C] and its resultant 8 bp deletion also causes premature 
truncation of the encoded protein by frameshift (84Cfs3). Patient alleles were 
also identified that contain both of these changes together with an additional 
silent nucleotide change ([201 A>G]) in the intervening segment, again 
consistent with the pseudogene sequence (Fig. 26). The [183TA>CT] and 
[258+2T>CJ changes could be detected in amplified SBDS genomic DNA 
followed by restriction digestion with Bsu36\ and Cac8l, respectively (Fig. 2a, 
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c). Analysis of SDS pedigrees revealed that these changes were inherited 
and disease-associated. An example of segregating alleles in a linked 
pedigree is shown in Fig 2c. The specificity of genomic DNA amplimers for 
SBDS was supported by the absence of additional pseudogene-like sequence 
changes in nucleotide positions flanking the 183 and 258+2 bp positions (Fig. 
2b) and the absence of any SBDSP-like sequences in 100 control samples. 
These findings, together with the observation of unaltered hybridisation 
patterns of genomic DNA with a SBDS probe (Fig. 2d), indicated that gene 
conversion due to recombination between SBDS and its highly homologous 
pseudogene had occurred. A similar basis for mutation has been observed in 
other genetic diseases 7 ' 19 . Sequence analysis of the exon 2 region of patients 
indicated that most conversion events are confined to a short segment 
between 141 bp and 258+124 bp with a maximum size of 240 bp (Fig. 2a, b). 
Based on restriction digestion or sequencing of PCR products of patients from 
158 unrelated families, 74% of SDS alleles (n=235 of 31 6) are the result of 
gene conversion, with 89% of patients carrying at least one converted allele 
and 60% carrying two converted alleles. Consistent with being a recessive 
disease, patients carry mutations on both copies of the SBDS gene. Of the 
patients analysed in the initial study, 50% were [183TA>CT] + [258+2T>C] 
compound heterozygotes, 5.1% were [183TA>CT + 258+2T>C] + 
[258+2T>C] compound heterozygotes, and 4.4% were homozygous for a 
[258+2T>C] conversion. Of patient alleles not displaying the conversion 
mutations, genomic sequencing revealed other changes within the coding 
region of SBDS, including small deletions, insertions, and nucleotide 
substitutions that would lead to frameshift and premature truncation, missense 
and nonsense changes (Table 1 and Fig. 4). To date, these mutations were 
not detected in 100 Caucasian control DNA samples by allele specific 
oligonucleotide hybridization or correspond to changes of highly conserved 
amino acids that would not be expected to be important for protein structure 
or function. Table 1 shows the SDS-associated mutations identified in the 
initial study and in subsequent studies. 
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Example 2 

RNA hybridisation with SBDS indicated broad expression of a 1.6 kb 
message (Fig. 3a). Numerous GenBank EST clones, however, indicated that 
the pseudogene is also transcribed. Prominent larger-sized transcripts were 
also observed in poly(A)+ mRNA from several tissues and were confirmed to 
include intron 1 through hybridisation of an intron 1 -specific probe (Fig. 3a). 
In addition, three GenBank EST clones corresponding to SBDSP were found 
to contain intron 1. 

RNA expression analysis was carried out on a number of normal 
adult or fetal tissues, and on lymphoblasts from a number of SDS patients. 
As seen from Figure 3b, the level of combined SBDS/SBDSP mRNA, and 
consequently of protein product, was notably reduced in patient samples, 
compared with control C, lymphoblast RNA from a healthy subject. 

Distinction between expression of the gene and pseudogene could 
be obtained through RT-PCR with specific oligonucleotide primers (Fig. 3c). 
Further, a broad survey of tissues revealed that the majority of SBDS mRNA 
does contain exon 2 although its alternative splicing was prominent in some 
patients (Fig. 3c and data not shown). Both RT-PCR and RNA analyses 
supported widespread expression of SBDS in all tissues examined, including 
the most disease-relevant tissues, pancreas, bone marrow, and myeloid 
lineages (Fig. 3a, c). 

Example 3 

Generation of antibodies for SBDS protein detection 

Two methods were used to generate specific antibody probes to detect 
SBDS protein cells and tissues. First, a bacterially expressed polypeptide 
with the entire open reading frame of SBDS and, second, specified peptides 
synthesised from the amino and carboxyl portion (see legend to Fig. 7), were 
used as immunogens in rabbits. To obtain high level expression of 
recombinant SBDS, the complete open reading frame of the SBDS gene was 
incorporated into the pET28a vector (Novagen) using standard molecular 
biology techniques (Ref. 30). The open reading frame was fused with the 
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(HIS)6 tag of the expression vector for purification with immobilised metal 
(Ni2+) affinity chromatography. The purified polypeptide was then conjugated 
and injected into rabbits with the services of Washington Biotechnology, Inc. 
Pre-immune and immune sera were collected and whole cell protein extracts 
of various cell types were assessed, Fig. 7. The amino and carboxyl peptide 
antibodies were synthesised and prepared with the services of AnaSpec, Inc. 
and Washington Biotechnology, Inc., respectively. The antibodies showed 
high affinity and specificity for the SBDS protein product in different organs 
and cell lines, by Western blotting carried out as follows. 

Whole cell extracts were prepared with Laemmli (E coli) or RIPA 
(mammalian cells) buffer (and separated by 13.5% PAGE prior to blotting on 
Hybond C Extra (Amersham) membrane (Ref. 30 and Harlow and Lane). For 
rSBDS and anti-CpSBDS anti-sera, the membrane was blocked with 7% skim 
milk in TBST (10mM TrisHCl, pH7.3, 100mM NaCI with 0.1% Tween 20) for 
overnight at room temperature followed by incubation of a 1 :2000 dilution for 5 
h at room temperature. The blot was washed with TBST for five consecutive 
washes and incubated with anti-rabbit secondary antibody (Stressgen 
Biotechnologies Corp). The anti-Myc (Oncogene Research Products) and 
anti-HA (BAbCO-Covance) monoclonal antibodies and the anti-mouse 
secondary antibodies (Jackson ImmunoResearch Labs, Inc.) were used as 
recommended by their suppliers. The immunoreactive bands were detected 
by enhanced chemiiuminescence. 
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Table 1: SDS-associated mutations 



Nucleotide Sentience Chanaes 

llUwIvvUUC WC\|UCIIVC viioiljjv^ 


Predicted Amino Acid 




Chanae 


183 184TA— *CT 


K62X 


hqo 1 84TA— ►HT+258+2T— >C 


K62X 


OCO + OT v,p 

zoo*^ I — *o 




O/p v A 
Z*fO — >r\ 


NI8K 


Qfi Q7incA 






S41fs17 

0*T 1191 1 


lo 1A — 


F44C5 


1QQA ^ 

i y yA — >o 


KR7F 


400* 1 V3 — r\s 


84Cfc3 

UtwIOJ 


ZDU 1 — 


187^ 

lOf o 


9Q1 -9Q^HalTA AinQAf^TTP AAf^TATH 


DQ7-K98delinsE\/Q\/S 

L/C/ i will IOI — V Vx V w 


Of /V3 — H-r 


R19RT 


505C—T 


R169C 


56G->A 


R19Q 


93C->G 


C31W 


97A-+G 


K33E 


101A-*T 


N34I 


123delC y 


S41fs17 


279 284delTCAACT 


Q94 V95del 


296 299delAAGA 


E99fs20 


354A->C 


K118N 


428C-+T+443A-+G 


S143L + K148R 


458A^G 


Q153R 


460-1 G-^A 


splice 


506G-»C^ 


R169P 


624+1 G-»C 


splice 
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Table 2: SBDS Polymorphisms 

Some sequence changes in SBDS are predicted to be silent polymorphisms. 
Although some of these changes were detected in SDS patients, allele- 
specific oligonucleotide hybridisation was used to screen control samples to 
determine that these changes are not disease associated and should be 
classified as silent polymorphisms. 



Nucleotide Sequence 
Change , 



Predicted Amino Acid 
Change 



Intron 1 
1 29-71 G-*A 
129-185G-*A 
129-225C-+G 
129-265G-»A 

Intron 2 
258+1 9A->G 
258+54T— G 
258+99A— C 

Intron 3 
459+92A->G 



Exon 2 

141C->T 

201A-*G 



L47L 
K67K 



Exon 5 

651C->T 

635T->C 



F217F 
I212T 



Rare Change 
210T— C 



D70E 
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The common mutations that account for the majority of SDS alleles can be detected by a 
PCR-restriction enzyme digestions of Bsu36l and CacSl. These digestions can be 
performed singly (as described on page 27, methods) or in combination as detailed below. 
The combination digestion permits distinction between the conversions encompassing 

PCR Amp lification fsame as for single digests already described^ 

Primer E (SEQ ID NO: 7): 5' -AAATGGTAAGGCAAATACGG- 3' 

Primer F (SEQ ID NO: 8): 5' -ACCAAGTTCTTTATTATTAGAAGTGAC- 3' 

product size: 733 bp 
annealing temperature: 56.6 C 
extension time: 40 sec 

Double Digestion 

Bsu36 1 (New England Biolabs #R0524): 6 units pius 
CacSl (New England Biolabs #R0579): 4.8 units 
per 100-200ng PCR product 
Digest 37 C>3hr 

Band Sizes detected o n agarose gel with ethidium bromide intercalation 

Normal: 584bp also 64bp, 41bp, and smaller bands that are difficult to see 
258+2 T>C: 431bp and 153bp also 64bp, 41bp and smaller bands 
183 TA>CT: 358bp and 226bp also 64 bp, 41 bp and smaller bands 
258+2T>C + 183TA>CT: 358bp, 153bp, 73bp also 64bp, 41bp and smaller bands 

Note: Cannot use Ddel for this double digest, npfst use Bsu36I and Cac8I. 

Mouse and human gene 
88% nucleotide identity 
97% amino acid identity 
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Daal Specific Digests for Common Mutations 

PCR Amplification 

Forward Primer 5' ^jGGGATTTGTTGTGTCTT- 3 ' 
Reverse Primer: 5' - CTTTCCTCCAGAAAAACAGC - 3' 

product size: 336 bp 
annealling temperature: 56 °C 
extension time: 1 rain 

Cac8 I Digest 

Cac8I (NEB #R0579): 4.8 units 
Digest 37 "C >3br 

Band Size 

Normal: 2X336 bp, 2X241 bp, 2X95 bp- 

1 allele with 258+2 T>C: 1 X 336 bp, 3 X241 bp, 3 X95 bp 

2 alleles with 25 8+2 T>C: 4 X 24 1 bp, 4 X 95 bp 

Dde I Digest 

Dde I (NEB #R0175): 6 units 
Digest 37 °C 2hr 

Band Size 

Normal: 2 X 190bp,2 X 169 bp,4X 146, 2 X21 bp 

1 allele with 183 TA>CT: 1 X 190 bp, 3 X 169 bp, 4 X 146, 2 X 21 bp 

2 alleles with 183 TAXJT: 4 X 169 bp, 4 X 146, 2 X 21 bp 
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We Claim: 

1 . A method for determining whether a subject is suffering from 
Schwachman-Diamond Syndrome (SDS) comprising 

obtaining a nucleic acid sample from the subject, and 
conducting an assay on the nucleic acid sample to determine the 
presence or absence of a SBDS gene mutation associated with SDS, wherein 
the presence of a SBDS gene mutation associated with SDS in both SBDS 
alleles indicates that the subject suffers from SDS. 

2. The method of claim 1 wherein the assay is selected from the group 
consisting of probe hybridisation, direct sequencing, restriction enzyme 
fragment analysis and fragment electrophoretic mobility. 

3. The method of claim 2 wherein the nucleic acid sample is a DNA 
sample or an RNA sample and the assay is a direct sequencing assay. 

4. The method of claim 3 wherein the nucleic acid sample is a genomic 
DNA sample and the assay comprises the steps of: 

(a) amplifying a target portion of the nucleotide sequence of the 
genomic DNA; 

(b) obtaining the nucleotide sequence of said amplified target 
portion; and 

(c) determining the presence or absence of a SBDS gene mutation 
associated with SDS in said target portion of the nucleotide 
sequence. 

5. The method of claim 3 wherein the nucleic acid sample is an RNA 
sample and the assay comprises the steps of: 

(a) reverse transcribing the RNA sample to produce a 
corresponding cDNA; 
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(b) performing at least one polymerase chain reaction with suitable 
oligonucleotide primers to amplify the SBDS cDNA; 

(c) obtaining the nucleotide sequence of the amplified SSDS cDNA; 
and 

(d) determining the presence or absence of a SBDS gene mutation 
associated with SDS in said nucleotide sequence. 



6. The method of claim 4 or 5 wherein the presence or absence of a 
mutation selected from the group consisting of 24C>A; 97_97insA; 1 19delG; 
131A>G; 183TA>CT; 183TA>CT + 201A>G+258+2T>C; 199A>G; 258+2T>C; 
258+1 G>C; 260T>G; 291_293delTAAinsAGTTCAAGTATC; 377G>C; 
505OT+651OT, 183J84TA CT; 183J84TA CT+258+2T C; 258+2T C; 
24C A; 96-97insA; 1 19delG; 131A G; 199A G; 258+1G C; 260T G; 291- 
293delTAAinsAGTTCAAGTATC; 377G C; 505C T; 56G A; 93C G; 97A G; 
101AT; 123delC; 279_284delTCAACT; 296_299delAAGA; 354A C; 428C 
T+443A G; 458A G; 460-1 G A; 506G C; and 624+1 G C is determined. 

7. The method of claim 4 or 6 wherein the target portion of the nucleotide 
sequence is amplified using a primer pair selected from the group consisting 
of: 

(a) Primer A and Primer B; 

(b) Primer E and Primer F; 

(c) Primer G and Primer H; 

(d) Primer SDCR9x4seqB and Primer J; 

(e) Primer SDCR9x5Fseq and Primer L; 

(f) Primer Q arid Primer B; 

(g) Primer I and Primer J; 

(h) Primer K and Primer L; and 

(i) Primer SDCR9prom1 RA and Primer SDCR9prom6FA. 

8. The method of claim 2 wherein the nucleic acid sample is a DNA 
sample and the assay is a restriction enzyme fragment analysis. 
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9. The method of claim 8 wherein the assay comprises the steps of: 

(a) digesting the DNA with a restriction enzyme to give restriction 
fragments; 

(b) separating the restriction fragments by agarose gel 
electrophoresis; and 

(c) detecting the separated fragments by hybridisation of the 
fragments to a detectably labelled nucleotide probe specific for 
SBDS. 

i 

1 0. The method of claim 9 wherein the restriction enzyme is at least one of 
Cac81 and Bsu361. 

11. The method of any one of claims 1 to 1 0 wherein the subject is a 
human subject. 

12. A method for determining whether a subject is an SDS carrier 
comprising 

obtaining a nucleic acid sample from the subject, and 
conducting an assay on the nucleic acid sample to determine the 
presence or absence of a SBDS gene mutation associated with SDS, wherein 
the presence of a SBDS gene mutation associated with SDS in one SBDS 
allele indicates that the subject is an SDS carrier. 

1 3. The method of claim 1 2 wherein the assay is selected from the group 
consisting of probe hybridisation, direct sequencing, restriction enzyme 
fragment analysis and fragment electrophoretic mobility. 

14. The method of claim 13 wherein the nucleic acid sample is a DNA 
sample or an RNA sample and the assay is a direct sequencing assay. 
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1 5. The method of claim 1 4 wherein the nucleic acid sample is a genomic 
DNA sample and the assay comprises the steps of. 

(a) amplifying a target portion of the nucleotide sequence of the 

genomic DNA; 

(b) obtaining the nucleotide sequence of said amplified target 
portion; and 

(c) determining the presence or absence of a SBDS gene mutation 
associated with SDS in said target portion of the nucleotide 
sequence. 

1 6. The method of claim 1 4 wherein the nucleic acid sample is an RNA 
sample and the assay comprises the steps of: 

(a) reverse transcribing the RNA sample to produce a 
corresponding cDNA; 

(b) performing at least one polymerase chain reaction with suitable 
oligonucleotide primers to amplify the SBDS cDNA; 

(c) obtaining the nucleotide sequence of the amplified SBDS cDNA; 
and; 

(d) determining the presence or absence of a SBDS gene mutation 
associated with SDS in said nucleotide sequence. 

1 7. The method of claim 1 5 or 1 6 wherein the presence or absence of a 
mutation selected from the group consisting of 24C>A; 97_97insA; 119delG; 
131A>G; 183TA>CT; 183TA>CT + 201A>G+258+2T>C; 199A>G; 258=2T>C; 
258+1 G>C; 260T>G; 291_293delTAAinsAGTTCAAGTATC; 377G>C; 
505OT+651OT; 183J84TA CT; 183JI84TA CT+258+2T C; 258+2T C; 
24C A; 96-97insA; 119delG; 131 A G; 199A G; 258+1G C; 260T G; 291- 
293delTAAinsAGTTCAAGTATC; 377G C; 505C T; 56G A; 93C G; 97A G; 
101 A T; 123delC; 279_284delTCAACT; 296_299delAAGA; 354A C; 428C 
T+443A G; 458A G; 460-1 G A; 506G C; and 624+1 G C is determined. 
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1 8. The method of claim 1 5 or 1 6 wherein the target portion of the 
nucleotide sequence is amplified using a primer pair selected from the group 



consisting 


of: 


(a) 


Pnmer A and Pnmer b, 


(b) 


Primer E and Primer F; 


(?) 


Primer G and Primer H; 


(d) 


Primer SDCR9x4seqB and Primer J; 


(e) 


Primer SDCR9x5Fseq and Primer L; 


(f) 


Primer Q and Primer B; 


(g) 


Primer land Primer J; 


(h) 


Primer K and Primer L; and 


0) 


Primer SDCR9prom1 RA and Primer SDCR9prom6FA. 



1 9. The method of claim 1 3 wherein the nucleic acid sample is a DNA 
sample and the assay is a restriction enzyme fragment analysis. 

20. The method of claim 1 9 wherein the assay comprises the steps of: 

(a) digesting the DNA with a restriction enzyme to give restriction 
fragments; 

(b) separating the restriction fragments by agarose gel 
electrophoresis; and 

(c) detecting the separated fragments by hybridisation of the 
fragments to a detectably labelled nucleotide probe specific for 
SBDS. 

21 . The method of claim 20 wherein the restriction enzyme is Nde 1 . 

22. The method of any one of claims 1 2 to 21 wherein the subject is a 
human subject. 



23. A method for determining whether a subject is suffering from 
Shwachman-Diamond Syndrome (SDS) comprising 
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obtaining a tissue sample from the subject, and 

conducting an assay on the tissue sample to determine the level of 
SBDS protein in the sample, wherein a reduced leve^of SBDS protein in the 
sample relative to a control sample indicates that the subject suffers from 
SDS. 

24. The method of claim 23 wherein the tissue sample is selected from the 
group consisting of blood, buccal smear or bone marrow aspirate. 

25. A method for determining whether a subject is at risk for developing 
acute myelogenous leukaemia (AML) comprising 

obtaining a nucleic acid sample from the subject, and 
conducting an assay on the nucleic acid sample to determine the 
presence or absence of a SBDS gene mutation associated with SDS, wherein 
the presence of a SBDS gene mutation associated with SDS indicates that 
the subject is at risk for development of AML. 

26. A method for treating a subject suffering from SDS comprising 
administering to the subject a therapeutically effective amount of a 
substantially purified SBDS protein or of an isolated nucleotide sequence 
encoding an SBDS protein. 

27. The method of claim 26 wherein a sample of bone marrow cells is 
obtained from the subject and the bone marrow cells are transfected with a 
nucleotide sequence encoding an SBDS protein and re-introduced into the 
subject. 

28. The method of claim 26 or 27 wherein the nucleotide sequence 
encodes a protein of amino acid sequence SEQ ID NO:2. 

29. The method of claim 26 or 27 wherein the nucleotide sequence is the 
sequence of SEQ ID NO:1 . 
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30. The method of claim 26 wherein the substantially purified SBDS 
protein has the amino acid sequence of SEQ ID NO:2. 

31 . An isolated nucleic acid molecule encoding an SBDS protein. 

32. The nucleic acid molecule of claim 31 wherein the protein is a human 
SBDS protein. 

33. The nucleic acid molecule of claim 32 comprising a nucleotide 
sequence selected from the group consisting of: 

(a) the nucleotide sequence of SEQ ID NO:1; 

(b) a nucleotide sequence encoding the amino acid sequence of 
SEQ ID NO:2; 

(c) a nucleotide sequence which is a complement of a nucleotide 
sequence of (a) or (b); and 

(d) a nucleotide sequence which hybridises under stringent 
conditions to a nucleotide sequence of (a) or (b). 

34. The nucleic acid molecule of claim 31 wherein the protein is a murine 
SBDS protein. 

35. The nucleic acid molecule of claim 34 comprising a nucleotide 
sequence which encodes the amino acid sequence of SEQ ID NO:29 or 
comprising the nucleotide sequence of SEG ID NO:29. 

36. The nucleic acid molecule of claim 34 wherein the nucleotide 
sequence has at least one mutation selected from the group consisting of 
24C>A; 97_97insA; 119delG; 131A>G; 183TA>CT; 183TA>CT + 
201A>G+258+2T>C; 199A>G; 258+2T>C; 258+1 G>C; 260T>G; 
291_293delTMinsAGTTCAAGTATC; 377G>C; 505OT+651OT, 
183JI84TA CT; 183J84TA CT+258+2T C; 258+2T C; 24C A; 96-97insA; 
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119delG; 131A G; 199A G; 258+1G C; 260T G; 291- 
293delTAAinsAGTTCAAGTATC; 377G C; 505C T; 56G A; 93C G; 97A G; 
101 AT; 123delC; 279_284delTCAACT; 296_299delAAGA; 354A C; 428C 
T+443A G; 458A G; 460-1 G A; 506G C; and 624+1 G C is determined. 

37. The nucleic acid molecule of any one of claims 31 to 35 wherein the 
molecule is a DNA molecule. 

38. The nucleic acid molecule of any one of claims 31 to 35 wherein the 
molecule is an RNA molecule. 

39. A recombinant vector comprising the isolated nucleic acid molecule of 
any one of claims 31 to 38. 

40. A host cell comprising the vector of claim 39. 

41 . An isolated nucleic acid molecule comprising at least about 10, 20, 30, 
50, 75 or 100 consecutive nucleotides of SEQ ID NO:1 or 29. 

42. A substantially purified SBDS protein. 

43. The protein of claim 42 comprising an amino acid sequence selected 
from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:2; 

(b) the amino acid sequence of SEQ ID NO:29. 

44. An antibody which binds specifically to an epitope of an SDS protein. 

45. The antibody of claim 44 wherein the antibody binds specifically to an 
SBDS protein having at least 89% amino acid identity with a protein 
comprising the amino acid sequence of SEQ ID NO:2 or SEQ ID NO:29. 
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46. A hybridoma cell line which produces an antibody in accordance with 
claim 44 or 45. 

47. A method for preparing an SBDS protein comprising expressing the 
nucleotide sequence of any one of claims 31 to 38 in a suitable expression 
system and collecting the expressed protein. 

48. A nucleotide sequence selected from the group consisting of: 

(a) 5'-GCGTAAAAAGCCACAATAC-3' (SEQ ID NO:3); 

(b) 5'-CTATGACAGTATTCGTAAGACTAGG-3' (SEQ ID NO:4); 

(c) 5'-GGGGATTTGTTGTGTCTTG-3' (SEQ ID NO:5); 

(d) 5'-CTTTCCTCCAGAAAAACAGC-3' (SEQ ID NO:6); 

(e) 5-AAATGGTAAGGCAAATACGG-3' (SEQ ID NO:7); 

(f) 5'-ACCAAGTTCTTTATTATTAGAAGTGAC-3' (SEQ ID NO:8); 

(g) 5'-GCTCAAACCATTACTTACATATTGA-3' (SEQ ID NO:9); 

(h) 5'-CACTTGCTTCCATGCAGA-3' (SEQ ID NO: 1 0); 

(i) 5*-AAAGGGTCATTTTAACACTTG-3' (SEQ ID NO:1 1 ); 

(j) 5'-GAAAATATCTGACGTTTACAACA-3' (SEQ ID NO:12); 

(k) 5-TCCACTGTAGATGTGAACTAACTC-3' (SEQ ID NO:1 3); 

(I) 5'-CACTCTGGACTTTGCATCTT-3' (SEQ ID NO:14); 

(m) 5'-GCTTCTGCTCCACCTGAC-3' (SEQ ID NO:1 5); 

(n) 5'AGCTATGCTGCAGCTGTTAC-3' (SEQ ID NO:1 6); 

(o) 5'-ATGCATGTCCAAGTTTCAAG-3' (SEQ ID NO:17); 

(p) 5'-TCCATGGCTATATTTTGATGA-3' (SEQ ID NO:1 8); 

(q) S-TAAGCCTGCCAGACACAC-S' (SEQ ID NO:19); 

' (r) 5'-CACTCTGGACTTTGCATCTT-3' (SEQ ID NO:20); 

(s) 5'-TGTTGGTTTTCACCGAATA-3' (SEQ ID NO:21 ); 

(t) 5-AGATAAAGAAAGACACACACAACT-3' (SEQ ID NO:22); 

(u) 5'-GAAATCGCCTGCTACAAA-3' (SEQ ID NO:23); 

(v) 5'-TCAGCTTCTTGCCTTCAT-3' (SEQ ID NO:24); 

(w) 5'-TAAGTAAGCCTGCCAGACA-3' (SEQ ID NO:25); 

(x) 5'-CATCAAGGTCTTTTTCCAAG-3' (SEQ ID NO:26); 
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(y) S'-CCTGTCTCTGCCCAAGTC-S' (SEQ ID NO:27); and 
(z) S'-AGGGAACATTTTCAAAACTCA-S" (SEQ ID NO:28). 

49. A transgenic non-human mammal having within its genome an SBDS 
gene with at least one mutation associated with SDS. 

50. The mammal of claim 49 wherein the mammal is selected from the 
group consisting of mice, rats, rabbits, sheep, goats and non-human 
primates. 

51 . The mammal of claim 49 wherein the mammal is a mouse. 

52. A kit comprising at least one pair of primers suitable for amplification of 
at least a portion of an SBDS gene. 
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N8K N34fs15 S41fe15| K62X K67E 84Cfs3| 

Ath MSKTLVQPVGQKRLTNVAWRLKKQGNRFBIACYKNKVLSHRSGV-EKDIDEVLQSHTVYSNVSKGVLAKSKDLMKSFGSDDHTKICIDI 

Dme MSK-IFTPTNQIRLTNVAIVRLKKGGKRFBIACYKNKVLSHRSNS-EKDIDEVLQTHTVFTNVSKGQAAKKDELQKAFNKTDErEICKEI 

Cel MSKNIKTPTNQKVLTNVAVVRMKKTGKRFEIACYKNKVVNHRNKS-EKDIDEVLQTHTVFSNVSKGQLSKKEELIAAFGIEDQLEICKII 

Mrau MS"~IFTPTNQIRLTNVAVVRMKRGGKRFBIACYK1?R^VGWRSGV-EKDLDEVLQTHSVFVNVSKGQVAKKEDLISAFGTDDQTBICKQI 

Hsa MS— IFTPTiifelP^l'NVA V VRMKJlAGKRgBI^^ 

Ola MS— IFTPTHQIRLTNVAWRMKKGGKRFBIACYKHKVMSHRTGA-EKDLDEVLQTPSVFINVSKGQTAKKDDLLKAFGTEDQTEICKQI 

See MP— INQPSGQIKLTNVSLVRLKKARKRFEVACYQNKVQDYRKGI-EKDLDEVLQI«QVFMNVSKGLVANKEDLQKCFGTTNVDDVIEEI 

Ecu MFTPLNQKKLVUVSIVTLKKFGRRYELAVYPHKLYEYRNGM-RTPLSEILQTDTIYRSVSKGEIARQGDLDLFCRT— HEEIVREI 

Mac MVSLDBAVTARLKRGSKHFEVLVEPEGALAYKRGE-EVNLEDILAVETIFEDAHRGDRAAESDII.NSFETTDPFEIAAVI 

Hnr MI8LDDAVTARLETHGERFEVLVDPDAALEMRRDEFDGELTDVIAARDVFENASRGDRPAESDLETVFGTTEPLEIIPEV 

Mth MVSLEDAVIARLBSHGERFEVLVDPDLAAEFRREDSDVSVEDVLAVQEVFRDARKGDKASEBAMRKVFETADPLEVTPVI 

Mka MARVSLEDAVVARLEKGGERFEVLVDPEGARKFREGE-DVDVEEILAVEQVFRDARKGERASEQAMEELFGTSDPIKVAEIV 

M j a MGRDIMVSLEBAVIARYTSHGEKFEILVDPYLAAXLKEGQ-NVDFDELLAIEVVFRDASKGEKAPEBLLSKI FGTTDVKEIAKKI 

A fU MVSLDKAVIARLRKGGEEFEVLVDPYLARDLKEGK-EVNFEDLLAAEEVFKDAKRGERASVDELRKI FGTDDVFEIARKI 

Pab MPISVDKAVIARLKVHGETFEILVDPYLARDFKEGK-EVPIEEILATPYVFKDAHKGDKASEKEMEKIFGTSDPYEVAKII 

Tac MVKVEDAIVARLESHGYHFEILVDPDAIERIRKGN— IDIENDLAFPEVYKDVRKGEKASDDSLKEAFKTTVI AQVAIEI 

Pae MTKKVAVAKLDKGGEHFEILIDPDAALELKMGK-PLGIDKVLVEEEIYKDAKKGLRASEQALKKVFGTTDVRKIAEII 

Sso MTKERDYVIVKYESHGERFEILAKPKEALAFRSGK-SISLSDVVVSDTIYKDVKKGLKAS PASLKKVFGTTDFETIVKEI 

Ape MAWMEVRGKRFEILVRPELAFRYKBKG-DVDLEDVIWTDTIYRDVRKGIiKASPEEVKKAFGTSDPRRVAEKI 

:*: : ..:•::..:*.: : : 

D97_K98delinsEVQVS R126T R1^9C 

Ath LEKGBLQVAGKERESQFSSQFRDIATIVMQKTINPETQ-RPYTISMVERLMHBIHFAVDPHSNSKKQALDVIRELQKfl— FPIKRSPMRL 

Dme LSKGELQVSEKERQSCLDTQLNSIVNSVAALCVHPETR-RPYPASIIEKSLKDAHFSVKMNRNTKQNTLEAIKILKDH— MPIERSRMKL 

Cel LDKGDLQVSEKERQAASDQSLKEVSQLIASMVVNPETK-RPVPPSVIDKALQEMHFSLKPNRSSKQQALDAI PKLRET LKIERAKMKI 

Mmu LTKGEVQVSDKERHTQLEQMFRDIATIVADKCVNPETK-RPYTVILIERAMKDIHYSVKPNKSTKQQAiEVIKQLKEK — MKIERAHMRL 

Bsa LT RGB VQ VS(iE3B RH TQ L EQMFRD I A T I VAI>KC VN PET K-0P YTVI LI ERAMRD I H Y S VKTNK S T ICQQALE V I KQ LKE K — MKIEQAHMRL 

Ola LAKGELQVS DKERQTQLETMFRDI ATTVADKCVNPETK-RPYPVSMIERAMKDIH YSVKPNKSTKQQALEVIRQLKET- -MEIQRAHMRL 

See MHKGEIQLSEKERQLMLHKVUNEMLTIVSAKCINPVSK-KRYPPTMIHKALQELKFSPVINKPAKLQALEAIKLLVSKQIIPIVRAKMKV 

Ecu LDCGYEQKSEATRVYEQEKTEREIVQILRNKVTRGGRH LSEASLREAIGKVHN— I YVGNSKKQSQEILSKLEKMG FDRVGV 

Mac LKSGELQLTAEQRKRMLBEKKKKVIYTISRNAINPQTR-APHPPARIERAMEEAKVHIDPLKSVDQLVTITMRAIRPL--IPIRFEEIKI 

Hnr IGQGEIGITADQRBAMQQRKKRSLINTISRNAINPQMDGAPHPPDRIESALDEAGFTVDPMTPADEQVDDALEALRPV — IPIRFEEMTV 

Mka IKEGEIQLTAEQRRRMQEEVKRKIIHIIARRAVDPRTG-APHPPERIERAMEEAGVHIDPMKSAEEQVKDVIKQLRPV— LPMKFEEVKV 

Mja ILKGQVQLTAKQRBEIREQKKRQIITIISRHTINPQTD-TPHPPHRIEKAMEELRINIDIYKSABEQVPETVKKLKKV — LPIRFEKRDI 

Afu ILEGEVQITAEQRREMLEAKRKQIINFISRNTIDPRTN-APHPPSRIERALEEAKVHIDIFKSVEAQVKDIVKALKPI — LPLKFEEMEI 

Pab ItRKGEVQLTAQQRREMLEEKKRQIATIIHRHAVDPRTG-YPHPVDRILRAMEEVGVRVDIFKDAEAQVQDVIKAIRRI — LPLRIEMKVI 

Tac VKKGQIQLTTEQRREMYDERRKQIVNLIAREGINPQTN-TPHTPYRISQAMDEAKVKIDPLKPAEDQVQNVLKAIMPI — 1PIRLEKAKI 

Pae IKEGEIPLTAEQRRKLIEDKKRQIVEWISRNCIDVRTK-TPVPPQRVENALEQARVSIDPFKSVEEQVQEVLKEIQRI--IPIKVATARV 

SSO LLKGELPVTAEQRKEMLETKRKQIIDF1HRNAVDPKTN-LPIPPTRLEMAMEQARIQIDLNKDVEAQAMQIVKEISKI — IPIKIARALL 

Ape LKEGEIQLTEEQRRRLLEAKRRQIISYIARNAIDPTTG-RPIPEARIEAALEEVRFPINLWRDAESQAVEAVRLIARV— MPIRLARALL 
:*:*..:: . : : . . : : '• 

I126T 

Ath RLTVPVQNFP-SLLEKLKEWDGSVVSKDES — GTQMSTVCEMEPGLFRECDSHVRSIQ GRLEILAVSVHAEGDTSMDHYDEHDDMAL 

Gar+ RLIVPGQNFH-SLCEKLNEWGATIVSKDES— GTQLSVICE1EPGLFRECDSLVRNLQ— GRLEILSVSVHAEGDTQVDNYDD-EDISS 

Pba+ GLTVSGQNFS-TLLEKLGAWDANVVSKDES — GSRQSIICEMDPGFFRDCDALVRNLQ GRLEILAVSVRFEEDTHVDDYDDYEDVAS 

Dme RVSFAGKBGGGKLKESVVKLAHAVEHEBWD — EATLHLTLLIDPGQYRVTDELVRNETKGKGLLELLELKEVVESEELF 

Cel RVAIPTKEAK-SVHTKLKTLFS DVEVDDWQ — DGSLEMVGLIEPGS FRALDDLVRNETKGHGRLEILSLKDVVEGELQI S 

Mmu RFILPVNEGK-KLKEKLKPLMKVVESEDYS — QQ-LEIVCLIDPGCFREIDELIKKETKGRGSLEVLSLKDVEEGDEKFE 

Hsa RFILPVNEGK-KLKEKLKPLIKVIESEDYG — QQ-LEIVCLSDPGCFREIDEIiIKKETKGKGSIiEVIiNLKDVBBGDEKPB 

Ola * RLQLPAKEAK-RLKEKLKPLLQVVESEEFD— BE-LEMICLVDPGCFRBIDELIRCETKGRGSLEVLSLKDVEEGEEKM 

See KVAISEPSRQPELIEKISKLIASSPGESTKPELDPWTCTGLIDPVNYRDLMTLCDK— KG— TVQVLDMAVIDNTTHN 

Ecu RVSVEMS DKVAE F VKQNGE I H DG YVMIRSDCFPRFKDMCEKEKVR — YLILRREEPEDEEIC 

Mac AVKIPPEYAP-KAYGDISKV-GTITKEEWQG-DGSWIAVVRIPAGVQTDFYALINHLTKGEAQTKLL 

Hnr AVQLPADYAG-SGQAKLREF-GELEREEWQA-DGSWVGVITFPAGMQDEFYGRVNEVSEGNGETSVVKDKDELKTR 

Mka AIRIPAKYTG-QAMGVVRE F-GDIEREEWQY-DGAWVAVVRLPAGLQDEFFEKLNBITKGDFESKILE-RESVEGP 

Mja AVKIPAEFAS-KAYNALYQF-GAVKQEEWQP-DGSLIVLIEIPSGIEAEFYAHLNKITKGNVQTKVVKKYSE 

AfU AIKIPPBHTG-RAISAIiYNF-GGVTREEWQR-DGSWICVMRIPSGMYGDLMDLLGKVAKGEALTKVLRRIG 

Pab AVKIPSBYVG-RAYGEVRKF-GRIKKEEWAS-DGSWLFLIEIPGGVEEEFYEKLNALTKGNAQTKLIERKGL 

Tac AVKLIGDAYG-KLYGELAKS-GYM-KEEHGK-DGSWMGILEVPAGIQGDIIENLSRRGGDKVQIKILKQ 

Pae ALAVSSTYAQ-RVKGLVAKM-AKIVNERYKS-DGSWEALLBLPAGLQDVLIARVUDVTHGDADIRILEIVY 

SSO SIKVPSEYSS-KVKSQLHNL-GEVKKANWLE-DGTLLAELEIPAGAQQDVIDKLNSLTKGEVEVKVLQVR 

Ape EVKIPPPHSG-RAYQALMRM-GEVKKADWLP-DGSLKAELEIPAGAQVEVTSRIQALARGAAEVKVKKVA 



Ath QTHKPI.LPAETET— KDLTDPVVEXSKKLQKQEISTTDNIKQEGGBEKKGTKCSTCNTFVGEAKQYREHCKSDWHKHNLKRKTRKLPPIS 
Gar+ QLPKDSAESASSRLPPESSDSVIQLSEKIQKHTIY— SGNGNAEGEAKQ-HKCSTCNAFVGDSKQYRDHFRSEHHKHNLKRKTRQLPPLT 
Pba+ ALPK BSTDSAVQLSEKIQKQTLS— DEK-KAGAEVK Q-HKCSTCMVSVGDAKQF 

U1 -like zinc finger 

Ath ADECMSEIDMDDSRADLKDYSF 
Gar+ AEECLADVELSDSKTDLQDYSF 
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SBDS cDNA Sequence ID NO:l 

-184 gtaagtaagc ctgccagaca cactgtgacg gctgcctgaa gctagtgagt cgcggcgccg 
-124 cgcactggtg gttgggtcag tgccgcgcgc cgatcggtcg ttaccgcgag- gcgctggtgg 
-64 ccttcaggct ggacggcgcg ggtcagccct ggttcgccgg cttctgggtc tttgaacagc 
-4 cgcgATGTCG ATCTTCACCC CCACCAACCA GATCCGCCTA ACCAATGTGG CCGTGCTACG 
+57 GATGAAGCGT GCCGGGAAGC GCTTCGAAAT CGCCTGCTAC AAAAACAAGQ TCGTCGGCTG 
+117 GCGGA6CGGC GTGGAAAAAG ACCTCGATGA AGTTCTGCAG ACCCACTCAG TGTTTGTAAA 
+177 TGTTTCTAAA GGTCAGGTTG CCAAAAAGGA AGATCTCATC AGTGCGTTTG GAACAGATGA 
+237 CCAAACTGAA ATCTGTAAGC AGATTTTGAC TAAAGGAGAA GTTCAAGTAT CAGATAAAGA 
+297 AAGACACACA CAACTGGAGC AGATGTTTAG GGACATTGCA ACTATTGTGG CAGACAAATG 
+357 TGTGAATCCT GAAACAAAGA GACCATACAC CGTGATCCTT ATTGAGAGAG CCATGAAGGA 
+417 CATCCACTAT TCGGTGAAAA CCAACAAGAG TACAAAACAG CAGGCTTTGG AAGTGATAAA 
+477 GCAGTTAAAA GAGAAAATGA AGATAGAACG TGCTCACATG AGGCTTCGGT TCATCCTTCC 
+537 AGTCAATGAA GGCAAGAAGC TGAAAGAAAA GCTCAAGCCA CTGATCAAGG TCATAGAAAG 
+597 TGAAGATTAT GGCCAACAGT TAGAAATCGT ATGTCTGATT GACCCGGGCT GCTTCCGAGA 
+657 AATTGATGAG CTAATAAAAA AGGAAACTAA AGGCAAAGGT TCTTTGGAAG TACTCAATCT 
+717 GAAAGATGTA GAAGAAGGAG ATGAGAAATT TGAAtgacac ccatcaatct CttcacctCt 
+777 aaaacactaa agtgtttccg tttccgacgg cactgtttca tgtctgtggt ctgccaaata 
+837 cttgcttaaa ctatttgaca ttttctactt tgtgttaaca gtggacacag caaggctttc 
+897 ctacataagt ataataatgt gggaatgatt tggttttaat tataaactgg ggtctaaatc 
+957 ctaaagcaaa attgaaactc caagatgcaa agtccagagt ggcattttgc tactctgtct 
+1017 catgccttga tagctttcca aaatgaaagt tacttgaggc agctcttgtg ggtgaaaagt 
+1077 tatttgtaca gtagagtaag attattaggg gtatgtctat acaacaaaag ggggggtctt 
+1137 tcctaaaaaa gaaaacatat gatgcttcat ttctacttaa tggaacttgt gttctgaggg 
+1197 tcattatggt atcgtaatgt aaagcttgga tgatgttcct gattatctga gaaacagata 
+1257 tagaaaaatt gtgccggact tacctttcat tgaacatgct gccataactt agattattct 
+1317 tggttaaaaa ataaaagtca cttatttcta attcttaaag tttataatat atattaatat 
+1397 agctaaaatt gtatgtaatc aataaaacca ctcttatgtt tatt 



SBDS Amino Acid Sequence ZD NO: 2 

1 MSIFTPTNQI RLTNVAWRM KRAGKRFEIA CYKNKWGWR SGVEKDLDEV LQTHSVFVNV 
61 SKGQVAKKED LISAFGTDDQ TEICKQILTK GEVQVSDKER HTQLEQMFRD IATXVADKCV 
121 NPETKRPYTV ILIERAMKDI HYSVKTNKST KQQALEVTKQ LKEKMKIERA HMRLRFILPV 
181 NEGKKLKEKL KPLIKVIESE DYGQQLEIVC LIDPGCFREI DELIKKETKG KGSLEVLNLK 
241 DVEEGDEKFE 



\ ■ 
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SBDS Exon 1: 

Primer A (SDCR9xlBP) -> 

SBDS qcgtaaaaaqccacaatac gcaggcgt, 

III llllllllllll MINIMI 

SBDSP gcggtaaaagccacaatgcgcaggcgt 

I III III 

MUSBDS aacgacccgccttcctttgaggtgcct 



Primer Q (RTSDCR91P) 

l 

SBDS catcgctcacttttcccctcccggcttctgctccacctgacgcctgcgcagtaagtaags 

llllllllllll III I Mill I III 1 1 MM I IIIIIIIIIMIIMM.III 1 1 MM I 

SBDSP catcqctcacttctcccctcccggcttctgctccacctgacgcctgcgcagtaagtaagc 

i ii i i i i i ii in 

MCTSBDS gggtggaactagagggcgtaaaaagtcacggcgcgcaggcgtggttgctttcttatcggc 



SBDS ctaccaqacacac tcrtcracqcfctqcctgaagctagtgagtcgcggcgccgcgcactggtg 

Ml III I Mill 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 III 

SBDSP ctgccagacacgctgtggcggctgcctgaagctagtgagtcgcggcgccgcgcacttgtg 

II I II I III I I I I Ml I I II I 

MC7SBDS ctagtgcgccacttgacgcatgtgcagtagggcaatcgggcgtgcggtagcttcttccct 



SBDS gttgggtcagtgccgcgcgccgatcggficgttaccgcgaggcgctggtggccttcaggct 

IIIIIIIIIIMM Ill 1 1 1 1 II 1 1 1 1 ! 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

SBDSP gttgggtcagtgccgcgcgccgctcggtcgttaccgcgaggcgctggtggccttcaggct 

I I III I I II II I I Ml I Ml M 

MUSBDS ggtaggttccggaagagccgcgcactccttgggcgttaagggttcgcgcgccgcagggtc 



M S 



SBDS qqacggcgcgggtcagccctggttcgccggcttctgggtctttgaacagccgcgATGTCG 

llllllllllllllllllllllll IMIMMMIIIMIIMMIMIIIIIIIIMI 

SBDSP qgacggcgcgggtcagccctggtttgccggcttctgggtctttgaacagccgcgatgtcg 

I in i i ii ii i 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 ii 1 1 

MOSBDS gtttcagccgagcacttggcgtcccctcgagctcgagatctgtgaacagccaccATGTCG 



M S 
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I P T P T 



QIRLTNVAVVRMK R 



ATCTTCACCCCCACCAACCAGATCCGCCT 

IIIIIIIIIMIIIIIMIIIIIIIIIIIIMMIIIIIIIIIIIMIIIIIIIIIIII 

atcttcacccccaccaaccagatccgcctaaccaatgtggccgtggtacggatgaagcgc 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 II lllllllllllllllll lllllllllll 

MUSBDS ATCTTCACCCCCACCAACC^GATCC 



SBDS 



SBDSP 



IFTPTNQIRLTNVAVVRMKR 



AGKRFEIACYKNKVVGWRSG 
SBDS GCCGGGAAGCGCTTCGAAATCGCCTGCTACAAAAACAAGGTCGTCGGCTGGCGGAGCGGC 

III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M ! I IIIIIIIIIIMIIIIIIMIIIIIIII 

SBDSP gccaggaagcgcttcgaaatcgcctgctacagaaacaaggtcgtcggctggcggagcggc 

I llllllllllllllllllllllllll I lllillllllllllllllllllll III 

MUSBDS GGAGGGAAGCGCTTCGAAATCGCCTGCTATAAAAACAAGGTCGTCGGCTGGCGGAGTGGC 



GGKRFEIACYKNKVVG 



R S 



128 



SBDS 



SBDSP 



MUSBDS 



GTgtgagtagccccctccctcgggcctgggcctgggcctgagccgtcacctccgaggcgg 

IMIIIIIIIIIIIIIIIIIIIIIIIilllMlllllllllllllllllllllllllll 

ttgtgagtagccccctccctcgggcctgggcctgggcctgagccgtcacctccgaggcgg 

iiiiniii n i ii i i ii ii i i ii i i 

GTgtgagtaatcctgtgcccagagttcggcggcctggcctccctaaccccggctcctgcg 



SBDS 



SBDSP 



MUSBDS 



cctgtctctgcccaagtcgagtgaatgggccaggctggggtgtt ggccggggagga 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiin mi iiiini 

cctgtctctgcccaagtcgagtgaatgggccaggctggggtgtttgttggcccgggagga 

i hi i ii i ii i ii ii 

acccatcggtacctttcaggcctggtttacccgattcgfgattgggttctgctttgggatt 



SBDS aatggaacattcctgctgtgagcatgagacgtcgctgtccgagcttggcgcctaagccaa 

M 1 1 M 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M i 1 1 1 1 1 1 1 1 i E 1 1 1 1 1 1 1 1 1 1 1 1 L [ 1 1 1 f ! I ! I 

SBDSP aatggaacattcctgctgtgagcatgagacgtcgctgtccgagcttggcgcctaagccaa 

i n n ii n mi n i 

MUSBDS ttgttagtatcataaaaactgccaactacaaacgccatcagagccgggtgggaccgatgg 



<- SDCR9xlseqRev 

SBDS aaatttct tctttatttgqttcrcrttccrq attqqqttqttqqtttgqggttttgttttgtt 

111111111 INI I MINIM I INI Ml I INI 1 1 MM MINIM I Ml I II 

SBDSP gggtttctt tatttggttggttccgattgggttgttggtttggggttttgttttgtt 

i i i i i i i i i i i in 
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MUSBDS tttaggcctgtaatcccagcgcccaggaaactgaggcaggaggattgctgcgatttccag 



SBDS ggtgtcataaaagctgcagccaagaaatctcgtaattgtggtccttttcctagaataatg 

lllllllllllllllllllllllllllllll llll Illllllllllllllllll 

SBDSP ggtgtcataaaagctgcagccaagaaatctcataattgtggtccttttcctagaataatg 

I i i ii n i I ii 1 1 1 1 i M 

MUSBDS gccagcctggaacgtgtgtgtgtgtgtatgtgtatgtgtgtgttgtgtgtgtgtatgtgt 

Primer B (SDCR9xlBR) 

SBDS atgorctaagaa cctaatcttaccraatactqtcataqr 

1 1 1 M 1 1 II 1 1 1 M ! 1 1 II lllllllllllllll 

SBDSP atggctgagaacctagtgttccgaatactgtcatag 

in n i " n i 

MUSBDS atgtgtgtgtgagagagaccgtgaccgaccctgtac 



SBDS Exon 2: 

Primer E (SDCR9x2BF) -> 
SBDS aaatggtaaqqcaaataccrq ttctqaqttttaaaaatgttccctcaggccgatgcgggca 

llllllll MINIMI IIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIII 

SBDSP aaatggtagggcaaatacagttctgagttttgaaaatgttccctcaggccgatgcgggca 

i ii i in i ii I i ii 

MUSBDS gtagtgtcttcgctactgccatctagggacagatattccaggacagaagaaacaccactc 



SBDS gttcacttgaggccaggagttcgaggccagcctggccaacatgaaaccccatctctacta 

. i iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii.iiiiiiiiiiii 

SBDSP gatcacttgaggccaggagttcgaggccagcctggccaacatgaaacaccatctctacta 

i n i i i i in i 

MOSBDS cccaccacaccctgagtttccttacataaaacaatgatgtagtttttccctctgtggtga 



SBDS aaaatacaaagttagccgggtgtggtggcgcatgcctgtaatcccagttactcaggaggc 

llllllllll IIIHIIIIIIIIIIIIIIMIIIIIIIIIIIIIII llllllllllll 

SBDSP aaaatacaaaattagccgggtgtggtggcgcatgcctgtaatcccagctactcaggaggc 

I II II Mi ll I II 

MUSBDS agtgggagaatccagatactgtccttcgcaggtagccaccagagagagagtgtggtgtgt 



SBDS tgaggcgggagaatcacttgaacccgggaggctgaggttacagtgacccgagatcgcgcc 

1 1 1 1 1 1 II MIM M III MM || III Mllll llllllll 

SBDSP tgaggcaggagaatcacttgaacccgggaggcggacgttgcagtgagccgagatcgcgcc 

i i mi i i i mi ii i 

MUSBDS gtgtgtgtgagatttctctttttttttttctttagggtttttgttttgtttttttttgtt 



ill, *of/o*% 
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SBDS 



SBDSP 



MUSBDS 



attgcactccagcctgggcaaaaacagtgaaattccatctaggggcgggggttggggggt 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 mini mm 

attgcactccagcctgggcaaaaacagtgaaattccatctaagggcggg gggggg- 

| I III II II M I 

ttgtttggttttttttttttttttttttgagactggcctcaaactcccaatttccctgcc 



SBDS 



SBDSP 



Primer C (SDCR9/SDCR9Lx2) -> 

BR rr aaaaa rf aaaa ri-gprrhrhar.a rhaaaggt cat caqqqqqat t tat t crt qt c t tgcc 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIII 

F lPg aaa r a ^^gr»^^^^^ar!ar■i•aaaggtGatcaqqqqqatttqttqtqtcttqcc 



MUSBDS tctgcctcctaaatggtgagttacagatgtgcacatcacacccagcttgcagcacttgcc 



SBDSP 



MUSBDS 



Primer 0 (SDCR9/SDCR9Lx2-3F)-> 

^^^a^r^t-^^g^^a^r'^nghaht-.t-.aaatata aatqcatqtccaaqtttcaaq tatatt 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIMIII 

jf-t-^ai-g^-t-gt-<-gnr>at-f7hrTg^ahthaaatata aatqcatqtccaa qtttcaaqtatatt 

II I Mill Mil II MINIMI I II II I I I I 

atttctcttgttgctatcttgtgtttaaatgtgagtggattttcttactatccagtggat 



129 
I 

VEKDLDEVLQ 
SBDS cacataggactttctctcctgccctcacaagGGAAAAAGACCTCGATGAAGTTCTGCAGA 

MilllilllllllllMMIMIIIMI IIIMIIIIIIMI MINN IMIIIIII 

SBDSP cacataggactttctctcctgccctcacaagggaaaaagaccttgatgaagttctgcaga 

lllllllllllllllllllllllll llllllll 1 1 MINN NJNJIJI 

MUSBDS cacataggactttctctcctgccctttcaagGGAAAAAGACCTTGATGAAGTTCTGCAGA 



VEKDLDEVLQ 



THSVFVNVSKGQVAKKEDLI 
SBDS ^CACTCAGTGTTTGTAAATGTTTCTAAAGGTCAGGTTGCCAAAAA^ 

lllllllllllllllllllllllll llllllllllllllll lillllllllllllll 

SBDSP cccactcagtgtttgtaaatgtttcctaaggtcaggttgccaagaaggaagatctcatca 

Mil IIIIIMIMIIIIIIIMI lllllllllllllllll llllllll lllllll 

MUSBDS CCCATTCAGTGTTTGTAAATGTTTCCAi^GGTCAGGTTGCCMGAAGGAAGACCT<^T^ 



TH SVFVNVSKGQVAKKE 



258 
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I 

SAFGTDDQTEICKQ 



SBDS GTGCGTTTGGAACAGATGACCAAACTGAAATC^ 

1 1 1 M 1 1 M 1 1 1 1 1 E ! 1 1 1 1 1 1 It 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 MINIMUM Mill 

SBDSP gtgcgtttggaacagatgaccaaactgaaatctgtaagcaggcgggtaacagctgcagca 

MM MMI MMI MMI MMMMMI MMMM Ml II III 

MUSBDS GTGCATTTGGGAC&GACGACCAGACTGAA^ 



SAFGTDDQTEICKQ 



SBDS tagctaaccctaataaccatttataacgtatttgtagatatattaaacattaaaggctgt 

MIIMMMMMIMIIMIIIIIIIMMMIMIMMIMMIIIIIMIIIMI 

SBDSP tagctaaccctaataaccatttataacgtatttgtagatatattaaacattaaaggctgt 

ii i ii ii ii i i i i i 

MUSBDS atgtaacaaaatctcacgatggtaggcaacatctggaccactgtgtttactgtttttctt 



4- Primer D (SDCR9/SDCR9Lx2R) 

SBDS " ttttctgaaqgaaaqa ctaaccaaacaataatataaactacacacrtatcacttctaataa 

IIMMMIIIMIII MM1MI I IMMMM MMM MM I lllllllllllll 

SBDSP ttttctgqaqqaaag actaaccaaqcaataatqtqaactgcacaatatcacttctaataa 

I I I I II I II Mi 

MUSBDS gatgagtttttgttgttttagcatttgttgggtccctcccacctccagtttatattgttg 



4- Primer F (SDCR9at2BR) 

SBDS taaagaacttgqt 

lllllllllllll 

SBDSP taaagaacttggt 

I II 

MUSBDS ggcaatttgggga ~ 



SBDS Exon 3: 



Primer G (SDCR9x3BF) SDCR9x3CF 

-> 

SBDS gctcaaaccattacttacatattqa taqctqqaaaqqatqaaatttaat tttctctccat 

lllllllllllllllllllllll I II II 11 1 1 1 1 1 II M 1 1 1 1 1 1 It 1 1 1 1 1 1 1 Ml 

SBDSP gctcaaaccattacttacatattaatagctggagaggatgaaatttaattttctcccca- 

ii i i i i i ii ill 

MUSBDS tgtaagctgctgctgggttaaggcagcacgtggttctgcgtgagcagctgcagtggacgc 



SBDS ccagttactcattttttatggttagttaataaatagtgtgtgatagagaaagatagtgat 

IIIIIMIIIMII I IIIIIIIMIIIIIIIIIIIIIIIIMIIIIMIIMIII 

SBDSP - - -gttactcattttttgtcgttagttaataaatagtgtgtgatagagaaagatagtgat 
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I II I II II 

MUSBDS cgcctcccttcctccccgctacctacctgtgcagtagagagatacccagaactgatgagg 



259 
I 

ILTKGEVQVSD 



SBDS ttcttaaatgtgttggcatttttttagATTTTGACTAAAGGAGAAGTTCAAGTATCAGAT 

llllll IIIIIIIIIIIIIIIIIIIIIIIMIIIIIilllllllllllllllllllll 

SBDSP ttcttaactgtgttggcatttttttagattttgactaaaggagaagttcaagtatcagat 

n i mi i iiiiMiiiiiiimiiiiiiiiiiiiiii iniii 

MUSBDS gctttctctatgttctgccatctttagATTTTGACTAAAGGAGAAGTTCAAGTGTCAGAT 

~I L T K G E V Q V S D~ 



Primer T (RTSDCR93P) -> 
K E R H T Q L 



QMFRDIAT 



V A D 



SBDS AAAGAAAGACACAGACAACTGGAGCAGATGTTTAGGGACATTGCAACTATTGTGQCAGAC 

Mill IIMIIIIMIIIIIIIMIIIMIIIIIIIIIIIII illllllllllll 

SBDSP aaaga cacacacaactggagcagatgtttagggacattgcaattattgtggcagac 

llllll I llllllll IIIIMIIIIIIIIIIIIII II II II MINIUM 

MUSBDS AAAGAACGGCACACACAGCTGGAGCAGATGTTTAGGGATATCG^ 

~K E R H T Q L E Q M F R D I A T I V A 5". 



K 



N P E T K R 



V .1 



E R A M 



SBDS AAATGTGTGAATC CTGAAACAAAGAGAC CATACAC CGTGAT C CTTATTGAG AGAGC CATG 

M|| Mill 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 

SBDSP aaatgtgtgactcctgaaacaaagagaccatacaccgtgatccttattgagagagccatg 

II llllllll II llllll I III MM II III II I Mill M III III II III I 

MUSBDS AAGTGTGTGAACCCAGAAACAAAGAGACCTTACACCGTTATCCT 

~K C V N P E T K R P Y~."t V I L I E R A M~ 



459 

Primer S (RTSDCR93R) | 
KDIHYSVKTNKSTKQQ 



SBDS AAGGAGATCCA CTATTCGGTGAAAACCAACAA GAGTACAAAACAGC^ C 

MIIIIMMMIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIIIIIMIMIIMIIIIIIII II 

SBDSP aaggacatccactatttggtgaaaaccaacaggagtacaaaacagcaggtgagtggtctc 

I III 1 1 II III I MM I'M Ml II Mill Mill Mill II Mill II I II 

MUSBDS AAGGACATC CACTACTC CGTGAAACCCAACAAGAGCACAAAGCAACAGgt aagggt t C C t 
~K D I H Y S V K P N K S T K Q Q~ 



SBDS 



SBDSP 



<- Primer P (SDCR9/SDCR9Lx2-3R) 
tcat gtcatcaaaatataaccatcrcra aatcaabtttctctaaaaaaatcattaaaataat 

1 1 1 1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 ! 1 1 M 1 1 1 ! 1 1 1 1 1 1 

tcatc rtcatcaaaatataaccatqqa aatcaattttcfcctaaaaaaatcattaaaataat 

I MM II III I I I I I II II 
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MDSBDS tgttgtcctcgggacctaaggccatggaagtgcctgatgcgcctgcctccctatctctgg 



SBDS gggtctggggccaggcacaatggttcatgcctgtaatcctagcactttgggagccaagat 

lllllll! I IMIIIIMIMIIMIII II llllllllllllllllllllllllllll 

SBDSP gggtctggggccaggcacaatggttcatacccgtaatcctagcactttgggagccaagat 

I I II INN I II II ■ ' I II I M 

MUSBDS- tgctggggtcagcagcacacacttccaggctgcctggctgtgctggtgctcatcattctg 



SBDS gggaggattgcttgaggcctggaaacagcctgggaaacatagggacgccccatctctaaa 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiin 

SBDSP gggaggattgcttgaggcctggaaacagcctgggaaacatagggacgccccatctctaaa 

i n ii mi i i i in i i i i ii 

MUSBDS agcagaccctctcccggctgagccatacccttagctgctgctcctcagtgtgacggaaca 



SBDS ttttttttttt tttttt tgagacagagtcttactctattgcccaggctg 

iiiiiii 111 i ii i ii 111 nun 111 in i nniiiiini 

SBDSP tttttttgtttattgttgtttttttgtttgagacagagtcgcactgtgttgcccaggctg 

I I I I I I I I II I 

MUSBDS caaatacacacagaactctttttgtttgtttgtttgtttgggggtttttttttttttttt 



SBDS gagtgcagtagtatgatctcggctcac-tacaatctccacctcccgcgttcaagcaagtc 

iiiiinii i i iiiiiiiniiii iiiiiiiiiiiiiiiiii.iiiiniiiiiiii 

SBDSP gagtgcagtggcacgatctcggctcacttacaatctccacctcccgcgttcaagcaagtc 

i i 111 ii mi ii ill 

MUSBDS ttagttttgtttttggtctttcgagacagggtttctctgtattgccctggctgtcctgga 



tcctgcctcagcctcctgagtagctgggattataggcacgtgccaccacactcagctaat 

llllllllllllllll MM MM INI Mlllll III llllllllll II 

SBDSP tcctgcctcagcctcccaagtagctgggattataggcacgcgccaccacacccagctaat 

III " II I I II II II I III I III 
MUSBDS actcgctctgtagcccaggctggcctcgaactcagaaatccgcctgcctctgcctcccaa 



SBDS tttg-tatttttagtagagttgaggtfctcaccatgttggccaggctggtcttgaactcct 

1 1 1 1 iiiiiiiiiiiiiiiiiiiini iiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

SBDSP tttgttatttttagtagagttgaggttttaccatgttggccaggctggtcttgaactcct 

i mi i 1 1 1 1 ii i ii 

MUSBDS gtgctgggattaaaggcgtgggccaccacacctggctcatacagaactcttatttcctgc 



SBDS gaccctaggtgatccgtccgccttggcctcccaaagtgctgggattacaggcatcagcta 

. llll 1 1 1 1 1 1 1 M 1 1 1 1 1 1 ! M I M M II 1 1 1 ! 1 1 1 1 M I i 1 1 M I M I 1 1 1 1 ! 1 1 M 

SBDSP gacctcaggtgatccgtccgccttggcctcccaaagtgctgggattacaggcatcagcta 
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III I I I I III 

MDSBDS ccagctcaaacctttaaagagaaagcttggactttgagtcacctgagcccttttgctgtt 



SBDS ccgtaccctacctctaaattttttaatataaaaaattaaatttaaaaaaatgggtctgca 

Illllllllllllllll IIIMIIMIIIMIIMIItMIIMIIMMIMI 1 1 1 1 

SBDSP ccgtaccctacctctaatttttttaatataaaaaattaaatttaaaaaaatgggtttgca 

i ii i i i i i i i i i i i 

MUSBDS tgtgtttattaacatatttcctacagctcagccctgtcacgccagccattctgctggcct 



<- Primer H <SDCR9x3BR) 
SBDS taaaacycaaqtq 

iiiiiiiiiin 

SBDSP tggaagcaagtg 

I I I II 
MUSBDS ggattccaagca 



SBDS Exon 4: 

Primer I (SDCR9x4CP) -> 

SBDS aaagggtcattttaacacttc tttttgaattttttaatttatatataattcacataccat 

III I MINI I II I II 1 1 1 1 Mil I III I Nil llllll II II Mill II MM II 

SBDSP aaagggtcattttaacacctctttttgaatttttcaatttacatataattcacatacaat 

i i i I I i ii i i i ii mi i 

MUSBDS ctcaaaagaaataacaagtcgggtgtggtggtgcacacctttaatcccagcactcgggag 



SBDS aaatttcacactcataaagtatgtacactttaagtggtatattaacaaagttttggaacc 

1 1 1 ii mm h Jinn i 1 1 1 1 iii i mi ii ii! i ' iii i ii tin 1 1 mill 

SBDSP aaatttcacactcataaagtgtgtacactttaagtggtatattaacaaagtttgggaacc 

i i n ii i . i i n 

MOSBDS gcagaggcaggcgaatttctgagttggaggccagcctgagttccaggacagccagggcta 



SBDS ttccctgctacctggttcgagaacattttcatcaccacaaaaagaaagtcagtatccatt 

Illllllllllllllll I II I I I I I I I II I I I I I I I II I I I I I I I I I M I M I I M I I I 
SBDSP ttccctgctacctggtttgagaacattttcatcaccacaaaaagaaagtcagtatccatt 

I I I III II II I III Mill III I I I 

MUSBDS tacagagaaaccctgtctcgaaaaaccaaaaaaaaaaaaaaaaaaaaaaaaagaaggaag 
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SBDS 



SBDSP 



MUSBDS 



agtagccatcccccattttccccccacaggcccctcccaaccactaatctcctctcgtta 

ill.lll IIIIIIMIIIIMIIIIIIIMIII lllllllllllllllllll llllll 

agtagctatcccccattttccccccacaggcccttcccaaccactaatctcctgtcgtta 

i ii i n i i i i i i i 

aaagaaagaaagcaagcaagcaagcaagcgagcaatggtgtttcacagcacgaagtatag 



SBDS tggacttctcaattctggacatttcatataaatggaatcatacaatatgtggccttttca 

lllllll 'llllllllllllllllllllllllllllllllllllllll lllllllllll 

SBDSP tggacttgtcaattctggacatttcatataaatggaatcatacaatatatggccttttca 

i i i ii i mi iiiiin 

MUSBDS tatgacccatataactaacagcctgcctgagttattactgcttaggcagtggcctgactt 



SBDS tggttcatacatgttgtaacctgcatcagcatgtcatttcttttttatgccggaataata 

II MM 1 1 II I MM INI Mil I I Mli III II III lllllll lllllllllll III 

SBDSP gggttcatacatgttgtaacctgcatcagcatgtcatttcttttttatgccggaataata 

I inn i i i in 

MUSBDS agacctgatcatgtacgtccagaaaaggcctggtggaaaactggaaggagccagagaaga 



SBDS gcccactgtacggaaagaaacacattttgttcattcatctatcagttgatagacattggg 

iiiiiiiiiiiinii mil iiiimmimi imiimmiiiiiiiii 

SBDSP gcccactgtacggaaaaaaacatattttgttcattcatttatcagttgatagacattggg 

II I II I I II I III 

MUSBDS acctccatacacaagaactctgggcaacctcagaactactcatgtccattccacaaccca 



SBDSP 
MUSBDS 



ttgctttcacttttgagctatgatgagcaatgctgctataaaatttcttgtatgtttctg 

llllllll MIIIIIIIIMIIIIMIIMIIMIIIIIIII IMIIIIIIIMIII II 

ttgctttcacttttgagctatgatgagcaatgctgctataaaatttcttgtatgtttttg 

ii ii ill 

accaggggcttctctgtacagggaacaagcacaggagagtcatcaagggactaacgagct 



SBDS tgtagacatatgttttcatttctgtatacctggtgactaccaaacctatttctaaaacag 

iimiimi mmmimimim miimmiiiiimiiiim 

SBDSP * tgtagacatatattttcatttctgtatacctggggactaccaaacctatttctaaaacag 

ii i i i i ii i i i i 

MUSBDS cacatcgaccacctgtgcactgttcccctctccataaacctcagattgcacaagctcagc 



SBDS ctgcaccattttactttaccaccatcagtgtttaagagttcagtttctccacatcctcag 

1 1 1 1 1 1 1 1 1 E M 1 1 1 1 1 1 1 1 1 1 1 III ! 1 1 1 1 E I i 1 1 1 1 1 1 1 1 E E 1 1 1 1 1 1 1 1 1 1 1 1 1 

SBDSP ctgcaccattttacattaccaccaacagcgtttaagagttcagtttctccacatcctcag 

ill i i i i iniiii ii ii • i II 
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MUSBDS ccccgtctcctccacatccagctgccagtgactgacgctgcctgcgggtcagtggcagag 



SBDS 



SBDSP 



MUSBDS 



taatacttgtcattgtctgcctttttgatgatggccatcctggtggtatcttgtcgtggt 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III MM llll Mil MM II II II MM II Mil II II 

taatacttgtcattgtctgtctttttgatgatggccatcctggtggtatcttgtcgtcgt 

• i -i- -i iiii in in i il l- 

gtgccaaggcaaaggcctgtgaggaccttactgtgtatcactaggcgtcccagcactctg 



SBDS 



SBDSP 



MUSBDS 



tttgatttgcatttccttaatgatgatttgagcatatttccatgtgcttattggtgcctc 

lllllllllllllllllllll M 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 M 1 1 1 M 1 1 1 

tttgatttgcatttccttaataatgatttgagcatatttccatgtgcttattggtgcctc 

in i in i ii i i i i i i i in 

gatgactgttattagactttcagggaagccactagttcttctacccagtgacagcttctc 



SBDS gtctgtcttcttttgagaaatctctgttcaggttctttgccc a c-c-c 

II llllll III II I II II II MINI III I UN MM I II III 

SBDSP gtctgtctgcttttgagaaatctctgttcaggttctttgccccctttttattctcgctct 

in i n i 

MUSBDS aggcacgggtgtccacagagtgggaagggccttgctggacggctggtgggaagctctggg 



SBDS — c-ccc c gc c-c — tct t-tttgcaaactctgcctcccgga 

I Ml I II II III I lllllllllllllllllllll 

SBDSP gtcacccagactagagtgcagtggcgcgatctcggctcattgcaaactctgcctcccgga 

III III III 

MUSBDS ccattttcccaaggagcatgtctctgctctcaccactgttagaattactgtgaactcagc 



SBDS ttcaagcaattctcctgcctcagcctcttgagtagctgggattacaggcgtgcactacca 

1 1 lllllll limil I II II I MM I III I II II II II I I II II MM I llllll 

SBDSP ttcaagcaattctcctgcctcagcctcttgagtagctggtactacaggcgtgtgctacca 

I II MM II I I IN I I I 

MUSBDS tatgggctcaggtcctcaaggttcatggcttaaaacagggttggcttagaagtctccgag 



SBDS cacccggctaatttttctttttttgtatttttagtggagacggggtttcaccatgttggc 

1 1 1 1 1 1 [ 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 ! Ml: MIMMMIMMIMMIIMI 

SBDSP cacccggctaatttttctttttttgtatttttagtagagacggggtttcaccatgttggc 

i ii i ii i i i i i ii in 

MUSBDS gccaacaaaaagacattttgtctgttctagagatgtacgaaattcccaccgcacacattt 



SBDS caggctggtctcgaattcctgaccttgtgatgcacccgcctcggcctcccaaagtgctgg 

MINIMI UMIIII IMMIMIMMMMIMIMIIMIMIMIMIIIMI 

SBDSP caggctggtctcgaatttctgaccttgtgatgcacccgcctcggcctcccaaagtgctgg 

i i ii i i i ii ii i I 

MUSBDS tcttgcttttagagagctgaggacagcccaggtcctcgtgcatgctgggtagttgcttca 



WO 2004/020658 



PCT/CA2003/001320 



16/22 



SDCR9x4seqB -> 

SBDS aat:tacaggcgtaagccaccacacct aaccttcactttcttcatacrt ttttt<Taaacaca 

mi iiiiiiiiiiiiiiiiiiiiiiiHTiiiiiiiiiini iiiiiiiiiiini 

SBDSP gattagaggcgtgagccaccacacctggccttcactttcttcataattttttgaaacaca 

i i i in i M i ii . 

MUSBDS " ■ ccactgaactgagtcccagcctttaacgttgctttctgccgaagcaaaaattattttttt 



SBDS aaagcttttcttcttgataagtccaatttttctattttttttttaacggtcacttatgtt 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiin iiiiiiiiiiimiiiimiiii 

SBDSP aaagcttttcttcttgataagtccaatttttcta-tttttttttaacggtcacttatgtt 

III I I I I I I II I III II I. 

MUSBDS ttccatttcacaaaatgagacactagctcattttttaggtatttctaggattgctggtac 



SBDS cttaatgttatacctaagaaaccattacctaatccaactacatggaaactactttgtttt 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIMIIIMIIII 

SBDSP cttaatgttatacctaagaaaccattacctaatccaactacatggaaactactttgtttt 

III II I II II I II .Mil 

MUSBDS cttggctgtaaaactgctggcataaggcagctatgtggaaactgctttgttcatgtctaa 



460 



SBDS tgaaaaccttatgaaataatatagtagaagaaattgcattctcgattttgtcttggtagG 

I I I I 1 I 1 1 I I 1 I 1 I I I I 1 1 1 1 I I I I I I I 1 1 I I I 1 1 I f I 1 I I 1 I 1 1 ) I 1 I I 1 I 1 I I I 1 i I i 
SBDSP tgaaaaccttatgaaataatatagtagaagaaattgcattctcgattttgtcttggtagg 

I I II I I I I III II Mill Ml 

MUSBDS catataaatttgtgcagcacaaaaactaagtaacgagcaccccttgttctgtcttaaagG 



ALEVIKQLKEKMKIERAHMR 
SBDS CTTTGGAAGTGATAAAGCAGTTAAAAGAGAAAATGAAG^^ 

i 1 1 1 ) 1 i 1 1 1 1 1 [ 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 

SBDSP ctttggaagtgataaagcagttaaaagagaaaatgaagatagaacgtgctcacatgaggc 

1 1 1 i 1 1 1 m 1 1 1 1 m j 1 1 1 1 i ilium 1 1 1 1 1 1 1 1 1 1 ) ii ii niiii i 

MUSBDS CTTTGGAAGTGATAAAGCAGCTGA^ 

A L E V I K Q L K I K M K I I R A H M R 



LRFILPVNEGKKLKEKLKPL 
SBDS TTCGGTTC^TCCTTCCAGTCAAT^ 

III II IN 1 1 1 il Mill II II II 1 1 1 II 1 1 II lllllil 1 1 II MM I MM II IN 

SBDSP ttcagttcatccttccagtgaatgaaggcaagaagctgaaagaaaagctcaagccactga 

i n iiiinii inn ii inn inniinn n inn iinniin 

MUSBDS TGCGCTTCATCCTGCCAGTGAACGAAGGGAAGAAGCTQAAGGAGAAGCTGAAGCCACTGA 
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LRFILPVNEGKKLKEKLKPL 



SBDS 



SBDSP 



MUSBDS 



KVIESEDYGQQLE 



624 
I 



TCAAGGTCATAGAAAGTGAAGATTATGGCCAACAGTTAGAAATCgtaagagtcaaatatt 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIMI II II IMIIII III II INI MM IMM II Ml I 

tcaaggtcatagaaagtaaagattatggccaacagttagaaatcgtaagagtcaaatatt 

I Mill I II Mill II II MM III I II IIIIMIII I 

TGAAGGTGGTGGAGAGTGAGGACTACAGCCAGCAGCTGGAGATCgtaagatgatggtggc 
M K V V E S 1 D Y S Q Q L E f 



SBDS 



SBDSP 



MUSBDS 



ttctttgcttcatgttacctaaatattgtattctctagtaataaatttgtagcaaacatt 

1 1 1 1 II 1 1 II M 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1! 1 1 1 1 1 1 1 1 

ttctttgcttcatgttacctaaatattgtattctctagtaataaatttgtagcaaacatt 

i i ii i in hi n 

ggggagcaggtggcgcagccaaggtcccatgattatgaccttaacacattattattcttg 



Primer J (SDCR9x4CR) 
SBDS taaa tcrt t crtaaac - gtcagatat t tt c 

in iniiiii 1 1 1 1 1 1 1 1 1 1 1 1 1 

SBDSP cagacattgtaaacagtcagatattttc 

n ii i n in 

MUSBDS gcttccttctacccaaatagcctcgttc 



SBDS 



5: 



Primer K (SDCR9x5CF) -> 
SBDS tccactcrtaqatqtcraactaactc atctqacactacttqaaattctaaaatctttqcaaa 

llllllllllllllllllllll I III I! Mllllllll MM IIIIMIII MM! Ill 

SBDSP tccactgtagatgtgaactaacccatctgacactacttgaagttctaaaatctttgcaaa 

i ii i i i i i i ii 

MUSBDS gtatactgtggctgtcttcagacacagcagaaggcatcggatcccattacagatggttgt 



SBDS 



SBDSP 



MUSBDS 



actgtacacatgggccaggcacagtggctcgtgcctgtaatcccagcactttgggaggcc 

IIIIMIII ! Mill IIIMIIIIIII M I MMMIIIMIMlllMMIM III 

actgtacacgtgggccaggcacagtggctcatacctgtaatcccagcactttgggaggcc 

II MM I I II I I I 

gagccacttgtggttgctgggaattgagctcagaacctctggaagagcagccagtgctga 



SBDS 



aaggtgagcagataacatggtgaaaccctatctctactaaaaatacaaaaaataagccag 

mi MMiiiiiiii Miiiiiiiii iiiiiiiiiiiiiiiiiiiiiiiiiiiiii 
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SBDSP 



MCJSBDS 



gaggcgagcagataacacggtgaaaccctgtctctactaaaaatacaaaaaataagccag 

I I I I I I I I II I 

gcatctctacagcctctgaacccagggtcttgatgctaagcagtgctcactctcagtatg 



SBDS gtgtggtggtggg-ttcctgtaatcccagtttcttgggaggctgaggcaggagaatcact 

iiiiiiiiiiin i imiiiimii iiiiiiinii nitiiiiiiiiiiiii 

SBDSP gtgtggtggtgggcgt-ctgtaatcccagtgtcttgggaggccgaggcaggagaatcact 

ii i i n i ii 

MCJSBDS agctgcagcactggccaggtgagtcttcaagggtgtcttaatcaggcttttactgctgtg 



SBDS tgaacctgggaggcggaggctgcagtgagccaagatcacaccactgcactctatctc-aa 

lllllllllllll IIMIMMIIIIIIIIilllllllllllllllllllllllll II 

SBDSP tgaacctgggaggtggaggctgcagtgagccaagatcacaccactgcactctatctcaaa 

I Mi l Mil II I I I I 

MUSBDS aacagacaccaggaccaatgcaagtcttataaagaacaacatttagttgagtctggctta 



SBDS aaaaaaat--aa-attaaeatacacatggtgtctacataagtcttcacattgctttttct 

llllllll II I IIIIIIIIIIIINIIIIII MMIMMIIII IMIIIIIII 

SBDSP aaaaaaataaaacaaaaacatacacatggtgtctacgtaagtcttcacattgctttttct 

i i i i i ii i i i 

MUSBDS caggttcagaggttcagtccattatcaaggtgggagcatggtagtatccaggtgggaatg 



SBDS ccttcatacgtggaggtgactttactgagctataaaatgtaatgctaaattttagtatga 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiniii 

SBDSP ccttcatacgtggaggtgactttactgagctataaaatgtaatgctaaattttagtatga 

mi in n i i in 

MUSBDS atacaggaggggctgagagttcgacatcttcatctgaaggctgctagcagaatactgact 



SBDS 



SBDSP 



MUSBDS. 



gaagaatcagagttttctagtttgtcccttccatttacagctgaagaatcagaataagtg 

IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII M 1 1 1 1 1 ! 1 1 1 1 M 1 1 1 1 

gaagaatcagagttttctagtttgtcccttccatttacagcggaagaatcagaataagtg 

i mi ii ii n i i i in i 

tcgaggctgttaggatgagggtcttaaagcctatgaccacagggacacaccttctaatag 



SBDS tttaaacatagggattaatgccttgtcacagggggctacatggacacttgagggcagagg 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiin iiiiiiiiiiiiin 

SBDSP tttaaacatagggattaatgccttgtcacagggggctacatggatacttgagggcagagg 

in ii i i i i i i i in 

MUSBDS . tgtcactccccgggctgagcatatacaaaccgtaacacgggataagtgcctttcccaaag 
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SBDS 



SBDSP 



MUSBDS 



ctaaactggaacccagtgtgccgccctacccattgtcttatctattgcaccatagaactg 

II IMIMMIM Mill MMMMIMM I MMIIIMIIIIMMIMIIMM 

ctgaactggaacccagtgtgccgccctacccattgtcttatctattgcaccatagaactg 

in i i i i i i 

tccaacagtaggtgcttagaatcgagacagaaccccaggcccagcctgctgccctggcct 



SDCR9x5Fseq -> 

SBDS tggtattattagagatctgqacagcattqt qcttqcctcaaaqqaaqtt aaaqctgaqtt 

llllllll lllllllll MIIIIIIIMIIIIMMIMI lllllllllllll 

SBDSP tggtatta gagatctggacagcattgtgcttgcctcaaag ttaaagctgagtt 

i i i n i in i i i i i i i mi i ii 

MUSBDS ccatgtgagcagcacctagaacacagtcatagatctgccctgagcattcaaactgggctt 



625 
I 

V C 

SBDS tattctgtgtcttgctcatcctcatgtggtaatctgctacgttaaatgtttcagGTATGT 

MIIIIMMIIIII IIIMIIMI Mill 1 1 1 1 1 1 1 ! 1 1 M 1 1 1 1 1 M 1 1 1 1 1 i 1 1 

SBDSP tattctgtgtcttgctcatcctcatttggtaaactgctacgttaaatgtttcaggtatgt 

i in 1 1 1 1 n mi ii i in 1 1 1 1 ii inn ii 

MUSBDS attctgtgccgatgcccatcttcccttggaaaccagctgtgttactcattgcagGTGTGC 

~V C~ 



D P G 



F R E 



E L 



K K E . T K G 



SBDS CTGATTGAC CCGGGCTGCTTCCGAGAAATTGATGAGCTAATAAAAAAGGAAACTAAAGG C 

M 1 1 1 J 1 1 1 1 IMmIIII MIIImI IM III III MIIIMIIIIMI! Mill 

SBDSP ctgattgacctgggctgcttccgagaaattgatgagctaataaaaaaggaaaccaaaggc 

ii n inn iiinnii illinium nun inn nm i ii mm 

MUSBDS CTCATCGACCCAGGCTGCTTCAGAGAAATTGATGAGCTAATAAAAAAGGAAACGAAAGG^ 
~L I D P G C F R E I 5 E h I K K E T K G~ 



750 

I 

KGSLEVLNLKDVEEGDEKFE 
SBDS AAAGGTTCTTTGGAAGTACTCAATCTGAAAGATGTAGAAGAAGGAGATGAGAAATTTGAA 

Mill IIIIMIIMMIIIMIIIIMIIIM I llllllll IMM MM Mill I 

SBDSP aaaggttctttggaagtactcaatctgaaagattt-gaagaaggagatgagaaatttgaa 

i nun mini mi mm n n n nm inniii mill 

MUSBDS AGGGGTTCTCTGGAAGTGCTCAGTCTGAAGGACGTGGAGGAAGGCGATGAGAAGTTTGAA 
~R G S L 1 V L S L K D V E E G D E K F E~ 



SBDS tgacacccatcaatctcttcacctctaaaacactaaagtgtttccgtttccgacggcact 

mniimn mnmiimmmnnimm imni n inn 

SBDSP tgacacccatcagtctcttcacctctaaaacactaaagtgttttcgtttccaacagcact 
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mini i nun i n i i i i i i i 

MUSBDS TGAcaccgcccggctcctcaactggagcacgaccgaggacgcttgttcctcacagcagca 



SBDS 



SBDSP 



MUSBDS 



gtttcatgtctgtggtctgccaaatacttgcttaaactatttgacattttctatctttgt 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 [ M 1 1 1 1 1 1 1 1 1 1 1 1 IIIIIIIIMIIMIIIIIIIIIIIII 

gtttcatgtctgtggtctgccaaatacttgctcaaactatttgacattttctatctttgt 

i i i n i I ii i in i i in 

gctcgttctgtgacctgccaaacgccctgctcacgcgacgtgccactttccatcttgtgt 



SBDS 



SBDSP 



MUSBDS 



gttaacagtggacacagcaaggctttcctacataagtataataatgtgggaatgatttgg 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIM 

gttaacagtggacacagcaaggctttcctacataagtataataatgtgggaatgatttgg 

i I n i i i i I I i i- i 

taaacatttacccaggtacctgggtatttttgttgtcaattggggtttccagcaaaaatg 



SBDS 



SBDSP 



MUSBDS 



ttttaattataaactcrcrqqtctaaatcctaaaqcaaaattqaaactcc aaqatgcaaacrt 

1 1 MM! IIMIllll I IIIMIMII Ml IIMMII! IMIII Mil llllllll I 

ttttaattataaactggggtctaaatcctaaagcaaaattgaaactccaggatgcaaaat 

i n ii. 

aaaaataacctaaaatacagagtccagaacagctgctcactgctgcgtctgcctttctag 



Primers L/R (RTSDCR95R/SDCR9x5BR) 
SBDS ccagagtggcattttgctactctgtctcatgccttgatagctttccaaaatgaaagttac 

imiiiiiiiiiiiiiiimiiiiiiiiiiiiiiiiiiiiiiiiiiiiiimiiiii 

SBDSP ccagagtggcattttgctactctgtctcatgccttgatagctttccaaaatgaaagttac 

n ii i i i i i i i 

MUSBDS ttccaggggaccagagacagcattggtggataagaaggtagagttagtccatgacagatc. 



SBDS 



SBDSP 



MUSBDS 



ttgaggcagctcttgtgggtgaaaagttatttgtacagtagagtaagattattaggggta 

IIIIIIIIMIIMIIIIMIIIIIIII MMMMMMMMMMMMM MM I 

ttgaggcagctcttgtgggtgaaaagttttttgtacagtagagtaagattattaggggta 

l i n I 1 1 1 1 I I I I I I mi 

attggagaggggtctgaataacaaagggggtacgcctgctggaaagaagatggggtgttt 



SBDS 



SBDSP 



MUSBDS 



tgtctatacaacaaaagggggggtctttcctaaaaaagaaaacatatgatgcttcatttc 

lllllllll llllll 1 1 1 1 M 1 1 1 1 M M 1 1 ! 1 1 1 IIIIIIIIIIIIIM 

tgtctatacgacaaaa-ggggggtctttcctaaaaaagaaaac--atgatgcttcatttc 

i ii ii i i i i i n 

ctgaataatgaagtgcaggtatggggtgtgagcatggagagaagagttcctgggtccctc 
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SBDS tacttaatggaacttgtgttctgagggtcattatggtatcgtaatgtaaagcttggatga 

imiiiiiiiiiiiiiiiiiiiiiiMimmiiiiiimi iiimiiiiiiim 

SBDSP tacttaatggaacttgtgttctgagggtcattatggtatcgtaatataaagcttggatga 

1 1 i ii ii in i i i 

MUSBDS ccaatagatttataatgactagggagaatttgactttctaattttcaaccaacatgctac 



SBDS tgt t cc tgat ta t c tgagaaacagata tagaaaaa t tgt gccggac - 1 1 acctt tea 

1 1 ! I ! I i i I M i 1 1 1 1 1 1 1 1 1 1 II M 1 1 J 1 M 1 1 1 1 1 1 1 1 Mill I II llll 

SBDSP tgttcctgattatctgagaaacagatatagaaaaattgtgtcggacttaaataattttcg 

1 1 1 1 i " i i i i i i i ini 

MUSBDS caaaactgacttagattattcttgggaaaatatatacagtcatttaatactaattcttaa 



SBDS ttgaacatgctgccataacttagattattcttggttaaaaaataaaagtcacttatttct 

Ml llll Mill llll II III II II I llllll Mill llll Mill III I llll II III 

SBDSP * ttgaacatgctgccataacttagattattcttggttaaaaaataaaagtcacttatttct 

llll II I.I II II II I I Ml 

MUSBDS aggtttataatatatgttagtatagttaaaattctatgtaatcaataaaacttattttta 



site) 
SBDS 
SBDSP 
MUSBDS 



(polyadenylation 



aattcttaaagtttataatatatattaatatagctaaaattgtatgtaatcaataaaacc 

1 1 1 J M !! 1 1 1 1 ! 1 1 1 1 1 i 1 1 1 1 1 ! 1 1 1 1 1 1 1 f 1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

aattcttaaagtttataatatatattaatatagctaaaattgtatgtaatcaataaaacc 



(end of human transcript, raRNA of 1605nt) 

i 

•SBDS actcttatgtttattaaactatggcttgtgtttctagacaacttcctaactccctttctt 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 } 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 f 1 1 E 1 1 )! 1 1 i 1 1 1 1 Ml 

SBDSP actcttatgtttattaaactatggcttgtgtttctagacaacttcctaactccctttctt 
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